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Run on: 



c searclTr*trsing sw model 




:58 ; Search time 547 Seconds 
(without alignments) 
7647.805 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



US- 10-722 -939-l-T39977_3 9901-40500 
600 

1 tacaatagaccctgcttctt aataagtgacagagctgtga 600 

IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched : 



5244920 seqs, 3486124231 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



10489840 



Post-processing : 



Minimum Match 0% 

Maximum Match 100% 

Listing first 100 summaries 
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N_Geneseq_8 : * 
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geneseqnl980s : * 
geneseqnl990s : * 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2001bs : * 
geneseqn2002as : * 
geneseqn2002bs : * 
geneseqn2003as : * 
geneseqn2 003bs : * 
geneseqn2003cs : * 
geneseqn2003ds : * 
geneseqn2004as : * 
geneseqn2004bs : * 
geneseqn2005s : * 
geneseqn2006s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
ADO79402 

ID ADO79402 standard; DNA; 76600 BP. 
XX 

AC ADO79402; 
XX 

DT 26-AUG-2004 (first entry) 
XX 

DE DLGl region, SEQ ID 1. 
XX 

KW Cytostatic; Gene therapy; breast cancer; human; DLGl; KIAA0783; DPF3; 



KW CENPCl; gene; ds; SNP; single nucleotide polymorphism; 

KW synapse -associated protein 97; hdlg; SAP97; chromosome 3q29. 

XX 

OS Homo sapiens. 



XX 

FH Key Location/Qualifiers 

FT variation 133 

FT /*tag= a 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 7938 
FT /*tag= b 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 8873 
FT /*tag= c 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a G/A SNP" 

FT variation 13221 
FT /*tag= d 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 17288 
FT /*tag= e 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 25732 
FT /*tag= f 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a G/A SNP" 

FT variation 26923 
FT /*tag= g 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a C/G SNP" 

FT variation 39977 
FT /*tag= h 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 41284 
FT /*tag= i 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/A SNP" 

FT variation 41410 
FT /*tag= j 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a A/C SNP" 

FT variation 41477 
FT /*tag= k 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 41514 

FT /*tag= 1 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a G/A SNP" 

FT variation 42606 
FT /*tag= m 

FT /standard_name= "Single nucleotide polymorphism" 



FT /note= "This SNP is described as a G/A SNP" 

FT variation 42742 

FT /*tag= n 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a A/C SNP" 

FT variation 59515 
FT /*tag= o 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a G/A SNP" 

FT variation 59808 
FT /*tag= p 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 602 65 

FT /*tag= q 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a C/G SNP" 

FT variation 67152 

FT /*tag= r 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 68332 
FT /*tag= s 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 71128 
FT /*tag= t 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a T/C SNP" 

FT variation 76427 
FT /*tag= u 

FT /standard_name= "Single nucleotide polymorphism" 

FT /note= "This SNP is described as a G/A SNP" 

XX 



PN WO2004047514-A2 . 
XX 

PD lO-JUN-2004. 
XX 

PF 25-NOV-2003; 2003WO-US037943 . 
XX 

PR 25-NOV-2002; 2002US- 042 913 6P . 

PR 24-JUL-2003; 2003US'0490234P . 
XX 

PA (SEQU-) SEQUENOM INC. 
XX 

PI Roth RB, Nelson MR, Braun A, Kammerer SM, Reneland R; 
XX 

DR WPI; 2004-441037/41. 
XX 

PT Identifying a subject at risk of breast cancer by detecting the presence 

PT of polymorphic variations in the DLGl, KIAA0783, DPF3 or CENPCl regions 

PT which are associated with breast cancer in a nucleic acid sample from a 

PT subject. 
XX 

PS Claim 24; Fig 1; 227pp; English. 
XX 

CC The present invention relates to a method for identifying a subject at 



CC risk of breast cancer. The method comprising detecting the presence or 

CC absence of one or more polymorphic variations associated with breast 

CC cancer in a nucleic acid sample from a subject. The nucleic acid sample 

CC comprises the DLGl region (ADO794 02) , KIAA0783 region (ADO79403) , DPF3 

CC region (ADO79404) or CENPCl region (ADO79405) . The gene DLGI (discs, 

CC large homolog 1 (Drosophila) ) is also known as synapse-associated protein 

CC 97, hdlg or SAP97. DLGl has been mapped to chromosomal position 3q29. The 

CC gene KIAA0783 is also known as PHF14 and PHD finger protein 14. KIAA0783 

CC has been mapped to chromosomal position 7p21.3. The KIAA0783 protein is a 

CC novel gene with unknown function, however, being a zinc finger protein, 

CC it likely to be a transcription factor. The gene DPF3 (D4, zinc and 

CC double PHD fingers, family 3) is also known as CERD4, cer-d4, FLJ14079 

CC and 2810403B03Rik , DPF3 is a Rho family guanine -nucleotide exchange 

CC factor. DPF3 has been mapped to chromosomal position 14q24 . 3-q31 . 1 . The 

CC gene CENPCl (centromere protein CI) is also known as Centromere 

CC autoantigen CI. CENPCl has been mapped to chromosomal position 4ql2- 

CC ql3.3. CENPCl is a centromere autoantigen and a component of the inner 

CC kinetochore plate. The CENPCl protein is required for maintaining proper 

CC kinetochore size and a timely transition to anaphase. The method is 

CC useful for identifying a subject at risk of breast cancer, for early 

CC diagnosis, prevention and treatment of breast cancer, to analyze and 

CC predict a response to a breast cancer treatment, and in clinical drug 

CC trials. 
XX 

SQ Sequence 76600 BP; 25920 A; 14119 C; 13638 G; 22902 T; 0 U; 21 Other; 



Query Match 100.0%; Score 600; DB 12; Length 76600; 

Best Local Similarity 99.8%; Pred. No. 2.4e-84; 

Matches 599; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 


60 






IIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIII 




Db 


39901 


TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 


39960 


Qy 


61 


ATCTACTTCATTTACTTGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAA 


120 






IMIIIIIIIIIIIIhllllllllllllllMIIMIIIIIIIIMIIIIIIIIIIIII 




Db 


39961 


ATCTACTTCATTTACTYGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAA 


40020 


Qy 


121 


AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 


180 






lllllllllllllllllllllllllillllllllllllllllllllllllllllllllll 




Db 


40021 


AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 


40080 


Qy 


181 


CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGA 


240 






IIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIMII 




Db 


40081 


CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGA 


40140 


Qy 


241 


AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 


300 






llllllllllllltllllllllllilllllllllllllllllMIMIIillllllllll 




Db 


40141 


AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 


40200 


Qy 


301 


GAACAAAAGAAATAGTGAAAAT7VATAGATTATATAAAAATGTTAAATAATAATTACAGCT 


360 






illllMIMIIIIIIIIIIIIIIIMIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 




Db 


40201 


GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 


40260 


Qy 


361 


ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 


420 






IIIIIIIIIIMIMIIIIIIIMIIMIIIMIIIIIIIIIIIIIIIIIIIIIIIIIM 




Db 


40261 


ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 


40320 



Qy 421 ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 4 80 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIM 
Db 40321 ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 40380 



Qy 481 CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 540 

IIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 40381 CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 40440 

Qy 541 GCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGATAATAAGTGACAGAGCTGTGA 600 

lllllllllllllllllllllllllllllllllllllllllllllllillllllllllll 
Db 40441 GCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGATAATAAGTGACAGAGCTGTGA 40500 



RESULT 2 
AEE84582/C 

ID AEE84582 standard; DNA; 2778 BP. 
XX 

AC AEE84582; 
XX 

DT 23-FEB-2006 (first entry) 
XX 

DE Discs large homolog 1 (DLGl) gene. 
XX 

KW ds; gene; Discs large homolog 1; DLGl; pharmaceutical; diagnostic; 

KW screening; apoptosis; cancer; neurodegenerative disease; breast tumor; 

KW cytostatic; endocrine disease; gynecology and obstetrics; neoplasm; 

KW lung tumor; respiratory disease; Alzheimers disease; neuroprotective; 

KW nootropic; degeneration; neurological disease. 

XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 1. .2778 

FT /*tag= a 

FT /products "Discs large homolog 1 (DLGl) " 

XX 

PN WO2005118631-A1. 
XX 

PD 15-DEC-2005. 
XX 

PF 02-JUN-2005; 2005WO- JP010538 . 
XX 

PR 03-JUN-2004; 2004 JP-00165407 . 
XX 

PA (TAKE ) TAKEDA PHARM CO LTD. 

XX 

PI Sunahara E, Ishii T; 
XX 

DR WPI; 2006-047512/05. 
DR P-PSDB; AEE84581. 

XX 

PT Complex useful for screening compounds utilized for promoting or 

PT inhibiting apoptosis for treating cancer or neurodegenerative diseases, 

PT comprises semaphorin 4B protein and discs large homolog 1 and/or 3 

PT protein. 

XX 



PS Disclosure; SEQ ID NO 28; 140pp; Japanese. 

XX 

CC The invention relates to a complex (I) comprising semaphorin 4B (SEMA4B) 

CC protein (PI) having an amino acid sequence which is similar or 

CC substantially similar to that of (SEQ ID N0:26; see AEE84580) , and discs 

CC large homolog 1 (DLGl) and/or DLG3 (P2) having an amino acid sequence 

CC which is similar or sxabstantially similar to that of (SEQ ID NO: 27 or SEQ 

CC ID NO:29; see AEE84581and AEE84583) . (PI) of (I) further comprises one of 

CC four fully amino acid sequences (SEQ ID No. 1, 4, 7 or 10; see AEE84555, 

CC AEE84558, AEE84561 and AEE84564) . Also included are: an antibody (11) 

CC with respect to (I) , or that inhibits or promotes dissociation or 

CC formation of (I) ; a pharmaceutical (III) comprising (II) ; a diagnostic 

CC (IV) comprising (11) ; screening (Ml) compound or its salt that inhibits 

CC or promotes binding of (PI) with (P2) or dissociation of (I) , involves 

CC utilizing (PI) or (P2); a kit for carrying out (Ml), comprising (PI) or 

CC (P2); a compound (V) or its salt that inhibits binding of (PI) with (P2) 

CC or promotes dissociation of (I) ; a promoter (VI) of apoptosis of cancer 

CC cell, the cancer proliferation inhibitor (VII) or preventive/therapeutic 

CC agent of cancer (VIII), comprising (V) or its salt; a compound (IX) or 

CC its salt that promotes binding of (PI) with (P2) or inhibits dissociation 

CC of (I) ; an inhibitor (X) of apoptosis of neuron, or 

CC preventive/ therapeutic agent (XI) of neurodegenerative diseases, 

CC comprising (IX) or its salt; promoting (M2) apoptosis or inhibiting (M3) 

CC cancer cell proliferation, involves inhibiting binding of (PI) and (P2) 

CC or promoting dissociation of (I) ; preventing or treating (M4) cancer, 

CC involves inhibiting binding of (PI) and {P2) or promoting dissociation of 

CC (I) ; inhibiting (M5) apoptosis of neuron or preventing or treating (M6) 

CC neurodegenerative diseases, involves promoting binding of (PI) and (P2) 

CC or inhibiting dissociation of (I) ; screening compound or its salt for 

CC preventing or treating cancer or neurodegenerative diseases, involves 

CC utilizing (P2) , its partial peptide or its salt; the use of a substance 

CC (XII) or its salt that inhibits binding of (PI) and (P2) or promotes 

CC dissociation of (I) for producing promoter of apoptosis of cancer cell, 

CC cancer cell proliferation inhibitor or preventive/therapeutic agent of 

CC cancer; and the use of a substance (XIII) or its salt that promotes 

CC binding of (PI) and (P2) or inhibits dissociation of (I) for producing 

CC inhibitor of apoptosis of neuron or preventive/therapeutic agent of 

CC neurodegenerative diseases. (Ill) is useful as promoter of apoptosis of 

CC cancer cell, cancer cell proliferation inhibitor or 

CC preventive/ therapeutic agent of cancer, or as apoptosis inhibitor of 

CC neuron or preventive/ therapeutic agent of neurodegenerative diseases. 

CC (IV) is useful for diagnosing cancer or neurodegenerative diseases. (VI), 

CC (VII) or (VIII) is useful for promoting apoptosis of cancer cell, 

CC inhibiting cancer proliferation, or preventing or treating cancer. (X) or 

CC (XI) is useful for inhibiting apoptosis of neuron, or preventing or 

CC treating neurodegenerative diseases. (M2) is useful for promoting 

CC apoptosis. (M3) is useful for inhibiting cancer cell proliferation. {M4) 

CC is useful for preventing or treating cancer. (MS) is useful for 

CC inhibiting apoptosis of neurons. (M6) is useful for preventing or 

CC treating neurodegenerative diseases. (XII) is useful for producing 

CC promoter of apoptosis of cancer cell, cancer cell proliferation inhibitor 

CC or preventive/therapeutic agent of cancer. (XIII) is useful iEor producing 

CC inhibitor of apoptosis of neuron or preventive/therapeutic agent of 

CC neurodegenerative diseases. (I) is useful for screening compounds 

CC utilized for promoting or inhibiting apoptosis for treating cancer (e.g. 

CC breast or lung cancer) or neurodegenerative diseases (e.g. Alzheimer's 

CC disease). (I) enables screening of promoter or inhibitor of apoptosis. 



CC The present sequence represents Discs large homolog 1 (DLGl) gene. 
XX 

SQ Sequence 2778 BP; 930 A; 545 C; 605 G; 698 T; 0 U; 0 Other; 

Query Match 16.3%; Score 98; DB 15; Length 2778; 

Best Local Similarity 100.0%; Fred. No. 1.5e-06; 

Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 909 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 850 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMII 
Db 849 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 812 



RESULT 3 
AED21168/C 

ID AED21168 standard; DNA; 2980 BP. 
XX 

AC AED21168; 
XX 

DT 15-DEC-2005 (first entry) 
XX 

DE Human Dig (disks large) DNA sequence. 
XX 

KW secreted f rizzled-related protein; tumor; cytostatic; neoplasm; 
KW drug screening; skin tumor; lymphoma; hematological disease; 
KW immune disorder; disks large; Dig; ds; gene. 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 
FT CDS 189. .2903 

FT /*tag= a 

FT /product= "disks large protein" 

XX 

PN WO2005094887-A1. 
XX 

PD 13-OCT-2005. 
XX 

PF 30-MAR-2005; 2005WO- JP006163 . 

XX 

PR 31-MAR-2004; 2004 JP-00106315 . 
XX 

PA (DAUC ) DAIICHI PHARM CO LTD. 

XX 

PI Akiyama T, Ishidao T, Aiba T; 

XX 

DR WPI; 2005-725415/74. 
DR P-PSDB; AED21169. 
XX 

PT Enhancer of expression and/or function of secreted f rizzled-related 
PT protein for inhibiting tumorigenesis , comprises a compound capable of 
PT increasing expression and/or function of discs large. 
XX 



PS Disclosure; SEQ ID NO 1; 54pp; Japanese. 

XX 

CC The new invention relates to the finding that tumor formation and the 

CC transcription of secreted f rizzled-related protein (sFRP) gene is lowered 

CC in a Dlg-knockout mouse. Specifically claimed is an enhancer (I) of the 

CC expression and/or function of secreted f rizzled-related protein (sFRP) , 

CC comprising a compound capable of increasing the expression and/or 

CC function of Dig (discs large) . Also claimed are a tumorigenesis inhibitor 

CC (II) containing (I) ; an agent (III) for preventing or treating tumor, 

CC containing (I) ; increasing (Ml) the expression and/or function of sFRP, 

CC comprising inducing the expression and/or function of Dig; a non-human 

CC mammal (IV) having one of the deleted Dig allele; a cell (V) derived from 

CC a non-human mammal having a deleted Dig allele; and detecting tumor 

CC tissue or cells, comprising measuring the expression and/or function of 

CC Dig in a sample of the cell or tissue, comparing the expression and/or 

CC function with a healthy tissue or a cell and detecting whether the 

CC expression and/or function is reduced. In (I) , the compound that 

CC increases the expression and/or function of Dig is chosen from a Dig gene 

CC or recombinant vector containing the Dig gene. The sFRP is sFRP2. (Ml) 

CC involves using a Dig gene or recombinant vector containing the Dig gene. 

CC (I) or (Ml) are useful for inhibiting tumorigenesis and for preventing or 

CC treating tumor. (IV) or (V) is useful for identifying a compound capable 

CC of increasing the expression and/or function of Dig or sFRP, or 

CC inhibiting tumorigenesis. (II) and (III) are useful for treating a tumor 

CC such as skin tumor or lymphoma. (I) effectively induces the expression 

CC and/or function of sFRP. The present sequence is human disks large DNA 

CC sequence . 

XX 

SQ Sequence 2980 BP; 949 A; 595 C; 672 G; 764 T; 0 U; 0 Other; 

Query Match 16.3%; Score 98; DB 14; Length 2980; 

Best Local Similarity 100.0%; Pred. No. 1.5e-06; 

Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIII 

Db 1097 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 1038 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIMIIIII 

Db 1037 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 1000 



RESULT 4 
AEF75023/C 

ID AEF75023 standard; DNA; 2980 BP. 
XX 

AC AEF75023; 
XX 

DT 06-APR-2006 (first entry) 
XX 

DE Human polynucleotide #537. 
XX 

KW Diagnosis; gene regulation; gene expression; 

KW post traumatic stress disorder; psychiatric disorder; tranquilizer; gene; 

KW ds. 

XX 



OS Homo sapiens. 
XX 

PN WO2006013561-A2. 
XX 

PD 09-FEB-2006. 

XX 

PF 02-AUG-2005; 2005WO-IL000824 . 
XX 

PR 02-AUG-2004; 2004US-0592408P . 
XX 

PA (YISS ) YISSUM RES DEV CO HEBREW UNIV JERUSALEM. 

PA (HADA-) HADASIT MEDICAL RES SERVICES & DEV LTD. 
XX 

PI Segman R, Shalev A, Goltser T, Friedman N, Shefi N, Kaminski N; 
XX 

DR WPI; 2006-145797/15. 

XX 

PT New kit comprising 10 and no more than 574 polynucleotides capable of 

PT specifically binding at least one specific polynucleotide sequence, 

PT useful for determining predisposition of a subject to develop PTSD, or 

PT for diagnosing PTSD. 
XX 

PS Claim 1; SEQ ID NO 537; 157pp; English. 
XX 

CC The invention relates to a kit for determining predisposition of a 

CC subject to developing post -traumatic stress disorder (PTSD) comprising at 

CC least 10 and no more than 574 polynucleotides, where each of the 

CC polynucleotides is capable of specifically binding at least one specific 

CC polynucleotide sequence. The invention also relates to a kit for 

CC diagnosing PTSD in a subject, agents for the manufacture of the kits 

CC cited comprising the polynucleotides cited, and a microarray comprising 

CC at least 10 and no more than 904 oligonucleotides where each of the 

CC oligonucleotides is capable of specifically binding at least one specific 

CC polynucleotide sequence. The kit comprises each of the polynucleotides 

CC selected from an oligonucleotide molecule, a cDNA molecule, a genomic 

CC molecule and an RNA molecule. Each of the polynucleotides is at least 10 

CC and no more than 50 nucleic acids in length. Each of the polynucleotides 

CC is bound to a solid support. The kit also comprises at least one reagent 

CC suitable for detecting hybridization of the polynucleotides and at least 

CC one RNA transcript. The kit further comprises packaging materials 

CC packaging the at least one reagent and instructions for using the kit in 

CC determining predisposition of the subject to developing PTSD, or for 

CC diagnosing the disease. The microarray comprises oligonucleotides of at 

CC least 10 and no more than 4 0 nucleic acids in length. The agent is 

CC capable of regulating an expression level of at least one gene as a 

CC pharmaceutical or for the manufacture of a medicament identified for 

CC preventing PTSD. The kit is useful for determining predisposition of a 

CC subject to developing PTSD or for diagnosing PTSD. This sequence 

CC represents a human polynucleotide of the invention. Note: The sequence 

CC data for this patent did not form part of the printed specification, but 

CC was obtained in electronic format directly from WIPO at 

CC f tp.wipo. int/pub/publishedjpct_sec[uences . 

XX 

SQ Sequence 2980 BP; 949 A; 595 C; 672 G; 764 T; 0 U; 0 Other; 



Query Match 16.3%; Score 98; DB 15; Length 2980; 

Best Local Similarity 100.0%; Pred. No. 1.5e-06; 



Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

lllllllllllllllllilllllllllllillilllllllllMIIIIMMIIIIIIII 

Db 1097 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 1038 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIII 
Db 1037 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 1000 



-3 



/ RESULT 5 = 
^ACC8507l/c 
ID ACC85071 standard; DNA; 3046 BP. 
XX 

AC ACC85071; 

XX 

DT 13-OCT-2003 (first entry) 
XX 

DE Human MBCAT polypeptide encoding DNA. 
XX 

BCW MBCAT; beta-catenin; cytostatic; gene therapy; cancer; human; gene; ds . 
XX 

OS Homo sapiens. 
XX 

FH Key Location/Qualifiers 

FT CDS 189. .2969 

FT /*tag= a 

FT /product= "MBCAT" 

XX 

PN WO2003052068-A2. 

XX 

PD 26-JUN-2003. 
XX 

PF 12-DEC-2002; 2002WO-US039796 . 
XX 

PR 13-DEC-2001; 2001US-0340213P . 
PR 13-DEC-2001; 2001US-0340314P . 
PR 13-DEC-2001; 2001US-0340322P . 
PR 15-FEB-2002; 2002US-0357502P . 
XX 

PA (EXEL-) EXELIXIS INC. 
XX 

PI Costa MA, Gendreau SB, Dora EG, Nicoll M; 
XX 

DR WPI; 2003-533010/50. 
DR P-PSDB; ABR82220. 
XX 

PT Identifying a candidate beta-catenin pathway modulating agent for 

PT diagnosing or treating cancer by detecting a test agent -biased activity 

PT of the assay system comprising a purified MBCAT polypeptide or nucleic 

PT acid. 

XX 

PS Example; Page 49-51; 81pp; English. 
XX 

CC The invention relates to genes that modify beta-catenin pathway and to 
CC the identification of human MBCAT (modifiers of beta-catenin) genes. The 



CC MBCAT polypeptides are therapeutic targets for disorders associated with 

CC beta-catenin function and are useful for manufacturing a medicament for 

CC diagnosing or treating breast, colon, lung or ovary cancer. The present 

CC sequence represents a human MBCAT polypeptide encoding DNA 
XX 

SQ Sequence 3046 BP; 975 A; 612 C; 681 G; 778 T; 0 U; 0 Other; 

Query Match 16.3%; Score 98; DB 9; Length 3046; 

Best Local Similarity 100.0%; Pred. No. 1.5e-06; 

Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

llllllllllllllillilllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMI 
Db 1097 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 1038 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

llllllllllllllllllllllllllllllllllllll 
Db 1037 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 1000 



^RESULT e] 
ADKl-i4-76/c 

ID ADK11476 standard; DNA; 3046 BP. 
XX 

AC ADK11476; 
XX 

DT 06-MAY-2004 (first entry) 
XX 

DE Human discs large (Drosophila) homolog 1 gene. 
XX 

KW ds; gene; cytostatic; cardiovascular; immunosuppressive; nephrotropic; 

KW antirheumatic; antiarthritic ; dermatological ; antipsoriatic; 

KW antiinflammatory; fungicide; gene therapy; Drosophila; diagnosis; 

KW cardiovascular disorder; autoimmune disease; glomerulonephritis; 

KW rheumatoid arthritis; dermatological disorder; psoriasis; 

KW inflammatory disorder; malaria; emphysema; alopecia. 

XX 

OS Homo sapiens. 
XX 

PN WO2003040301-A2. 
XX 

PD 15-MAY-2003. 
XX 

PF 23-OCT-2002; 2002WO-GB004780 . 
XX 

PR 05-NOV-2001; 2001GB-00026506 . 

PR 27-NOV-2001; 2001GB-00028384 . 

PR ll-FEB-2002; 2002GB-00003185 . 
XX 

PA (CYCL-) CYCLACEL LTD. 
XX 

PI Deak P, Frenz L, Glover D, Midgley C; 
XX 

DR WPI; 2003-441540/41. 

DR P-PSDB; ADK11477. 
XX 

PT New Drosophila polypeptides and polynucleotides, useful for diagnosing, 



PT preventing and/or treating disorders, such as cancer, glomerulonephritis, 

PT rheumatoid arthritis, psoriasis, malaria, emphysema and alopecia. 

XX 

PS Example 28; Page 225; 265pp; English. 
XX 

CC The invention relates to novel Drosophila species DNA sequences and their 

CC encoded proteins with their corresponding human homologues . The proteins 

CC or their encoding polynucleotides are useful in a method of prevention, 

CC treatment or diagnosis of a disease in an individual, and used to 

CC identify a substance capable of binding to the polypeptide or modulating 

CC the function of the polypeptide comprising incubating the polypeptide 

CC with a candidate substance and determining whether the substance binds to 

CC the polypeptide. The compositions are administered to an individual in 

CC need of such treatment. The method of diagnosis, in which the presence or 

CC absence of a polynucleotide is detected in a biological sample, comprises 

CC brining the biological sample containing the nucleic acid such as DNA or 

CC RNA into contact with a probe comprising a fragment of at least 15 

CC nucleotides of the polynucleotide, and detecting any duplex formed 

CC between the probe and nucleic acid in the sample. The method also 

CC comprises providing an antibody capable of binding to the polypeptide, 

CC incubating a biological sample with the antibody to allow the formation 

CC of an antibody -antigen complex, and determining whether antibody-antigen 

CC complex comprising the antibody is formed. The disease comprises a 

CC proliferative disease such as cancer. The antibody or identified 

CC substance is also useful in inhibiting the function of a polypeptide 

CC and/or regulating a cell division cycle function. The diseases also 

CC include cardiovascular disorders, autoimmune diseases such as 

CC glomerulonephritis and rheumatoid arthritis, and dermatological disorders 

CC such as psoriasis, inflammatory, fungal, and parasitic disorders such as 

CC malaria, emphysema and alopecia. This sequence represents a human homolog 

CC gene for one of the Drosophila genes of the invention. 

XX 

SQ Sequence 3046 BP; 975 A; 612 C; 681 G; 778 T; 0 U; 0 Other; 

Query Match 16.3%; Score 98; DB 10; Length 3 046; 

Best Local Similarity 100.0%; Pred. No. 1.5e-06; 

Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

illlllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 1097 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 1038 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

IIIIIIIIIIIIIIIIMMIIIIIIIIIIIIIIIIII 

Db 1037 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 1000 



RESULT 7 
ADR83538/C 

ID ADR83538 standard; DNA; 3046 BP. 
XX 

AC ADR83538; 
XX 

DT 02-DEC-2004 (first entry) 
XX 

DE Human discs large homolog 1 DNA, target gene of miRNA. 
XX 



KW human; ds; miRNA; microRNA; ontogenesis; cell therapy; cancer; 

KW immune disease; nerve disorder; amyotrophic lateral sclerosis; 

KW Parkinson* s disease; Alzheimer's disease; inflammatory disease; 

KW siRNA silencing precursor; cytostatic; immunosuppressive; nootropic; 

KW neuroprotective; antiinflammatory; immunotherapy; discs large homolog 1. 

XX 

OS Homo sapiens. 
XX 

PN WO2004076622-A2. 
XX 

PD lO-SEP-2004. 

XX 

PF lO-FEB-2004; 2 004WO- JP001433 , 
XX 

PR lO-FEB-2003; 2003US- 0445829P . 
XX 

PA (NAAD-) NAT INST ADVANCED IND SCI & TECHNOLOGY. 

XX 

PI Taira K, Kawasaki H; 

XX 

DR WPI; 2004-653393/63. 
XX 

PT Modulating expression of a target gene in a cell, for treating cancer, an 

PT immune disease, or a nerve disorder, comprises introducing into the cell 

PT a polynucleotide that forms a duplex region with an mRNA transcribed from 

PT the target gene. 
XX 

PS Claim 9; SEQ ID NO 440; 865pp; English. 
XX 

CC This invention relates to a novel method for modulating the expression of 

CC a target gene in a cell. Specifically, it refers to the introduction into 

CC a cell of a polynucleotide that forms a duplex region with an mRNA 

CC transcribed from the target gene, where the duplex region comprises a 

CC mammalian miRNA target region i.e. a non-coding microRNA (miRNA) that 

CC regulates mRNA at a post-transcriptional level. The present invention 

CC describes a method for controlling ontogenesis of a mammal, function of a 

CC mammalian cell, differentiation of a mammalian cell or viability of a 

CC mammalian cell in the post-transcriptional phase, which comprises 

CC introducing a plasmid vector comprising a promoter and nucleic acid 

CC molecule expressing an miRNA or siRNA silencing precursor to the miRNA. 

CC Accordingly, it provides a cell therapy method for treating cancer, 

CC immune disease, nerve disorder (e.g. amyotrophic lateral sclerosis, 

CC Parkinson's disease, or Alzheimer's disease) or an inflammatory disease 

CC by introducing into the cell the miRNA, siRNA silencing precursor to the 

CC miRNA or the plasmid vector. As such, they can be developed into 

CC pharmaceutical compositions that exhibit cytostatic, immunosuppressive, 

CC nootropic, neuroprotective and antiinflammatory activities and hence can 

CC be used for immunotherapy. This polynucleotide sequence is a human target 

CC gene whose expression is modulated by miRNAs of the invention. 

XX 

SQ Sequence 3046 BP; 975 A; 612 C; 681 G; 778 T; 0 U; 0 Other; 

Query Match 16.3%; Score 98; DB 13; Length 3046; 

Best Local Similarity 100,0%; Pred. No. 1.5e-06; 

Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 



IIIIIIIIIIIIIIMIIIIIIIMIIIIIIIIIIIIillMIIMIIIIIIIIIIMII 

Db 1097 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 1038 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

IIIIIIIIIIMMIIMIIIIIIIIiMIIIIIIIII 

Db 1037 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 1000 



RESULT 8 
ADU05806/C 

ID ADU05806 standard; DNA; 3046 BP. 
XX 

AC ADU05806; 
XX 

DT 27-JAN-2005 (first entry) 
XX 

DE Novel bronchial cancer-associated human gene SeqID28. 
XX 

KW bronchial cancer; cytostatic; tumour-associated protein; 

KW cancer detection; metastasis; tumour; gene; ds; human. 
XX 

OS Homo sapiens. 
XX 

PN DE10316701-A1. 
XX 

PD 04-NOV-2004. 
XX 

PF 09-APR-2003; 2003DE-01016701 . 
XX 

PR 09-APR-2003; 2003DE-01016701 . 

XX 

PA (HINZ/) HINZMANN B. 

PA (HERM/) HERMANN K. 

PA (CAST/) HEIDEN CASTANOS-VELEZ E. 

XX 

PI Mennerich D, Bruemmendorf T, Heiden E, Hermann K, Kinnemann H; 

PI Li X, Roepcke S, Staub E, Hinzmann B, Rosenthal A, Pilar sky C; 
XX 

DR WPI; 2004-786403/78. 

DR P-PSDB; ADU06293. 
XX 

PT New nucleic acid, and derived proteins, useful for diagnosis of bronchial 

PT cancer and in screening for therapeutic and diagnostic agents. 

XX 

PS Claim 1; SEQ ID NO 28; 1381pp; German. 
XX 

CC This invention relates to a novel isolated nucleic acid associated with 

CC bronchial cancer comprising 489 defined sequences given in the 

CC specification. The invention may be useful for the production of 

CC compounds with a cytostatic activity through the inhibition of expression 

CC or activity of tumour-associated proteins. The novel DNA sequences and 

CC the proteins /peptides encoded by them are used for detecting bronchial 

CC cancer or determining the risk of developing it and to screen for 

CC specific binding partners of the DNA or protein sequences, where the 

CC binding partners are potentially useful as agents for treating or 

CC diagnosing bronchial cancer. The DNA or protein sequences can also be 

CC used for prognosis, detection of metastases and for secondary treatment 
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/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllA_PUBCOMB. seq: * 
/EMC__Celerra_SIDS3/ptodata/2/pubpna/USllB_PUBCOMB. seq: * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllC_PUBCOMB . seq: * 
/EMC_Celerra_SIDS3/ptodata/2/pubpna/USllD__PUBCOMB . seq : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 599.6 100.0 76600 10 US-10-722-939-1 
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ALIGNMENTS 



RESULT 1 
US-10-722-939-1 

; Sequence 1, Application US/10722939 

; Publication No. US20050214771A1 

; GENERAL INFORMATION: 

; APPLICANT: ROTH, RICHARD B. 

; APPLICANT: NELSON, MATTHEW ROBERTS 

; APPLICANT: KAMMERER, STEFAN M. 

; APPLICANT: BRAUN, ANDREAS 

; APPLICANT: RENELAND, RIKARD 



; TITLE OF INVENTION: METHODS FOR IDENTIFYING RISK OF BREAST CANCER AND 
TREATMENTS 

; TITLE OF INVENTION: THEREOF 

; FILE REFERENCE: SEQ-4071-UT 

; CURRENT APPLICATION NUMBER: US/lO/722 , 939 

; CURRENT FILING DATE: 2003-11-26 

; PRIOR APPLICATION NUMBER: 60/429,136 

; PRIOR FILING DATE: 2002-11-25 

; PRIOR APPLICATION NUMBER: 60/490,234 

; PRIOR FILING DATE: 2003-07-23 

; NUMBER OF SEQ ID NOS : 634 

; SOFTWARE: PatentIn version 3.2 

; SEQ ID NO 1 

LENGTH: 76600 

TYPE: DNA 
; ORGANISM: Homo sapiens 
US-10-722-939-1 



Query Match 100.0%; Score 600; DB 10; Length 76600; 

Best Local Similarity 99.8%; Pred. No. 1.2e-90; 

Matches 599; Conseirvative 1; Mismatches 0; Indels 0; Gaps 0; 
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ATT'TAr^TTT^ A'l'T'r AO'PV/^T'A AT'AT'A^ A/^TT* AT'T'/** A/^/^/^T"/^ A^^ A A A A A A T"T> A A AT'A'P'T' A A 

AlL.lAd lUAx 1 lAv^l JLVjiAAlAiAL.AVjiUAl ioAUwU i vyAvjAAAAv^AAI 1 AAA! AX lAA 


>i A r\ 0 
4U0z0 


Qy 


121 


AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 


180 






lllllllllillllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


40021 


AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 


40080 


Qy 


181 


CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAA?U^CATAAATGA 


240 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


40081 


CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGA 


40140 


Qy 


241 


AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 


300 






IIIIINIIIIIIIIIIIiMillllllllllllMIIIIIIIIIIIIIIIIIIIIIIII 




Db 


40141 


AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 


40200 


Qy 


301 


GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 


360 






llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 




Db 


40201 


GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 


40260 


Qy 


361 


ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 


420 






IIIIIIIIIIIIIMIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 




Db 


40261 


ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTT^GACACACTTTATAAAA 


40320 


Qy 
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ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 


480 






llllllllllllllllllllllllllllllllllllllllilllllllllllllllllll 




Db 


40321 


ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 


40380 


Qy 
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CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 


540 






llllllllllllllillllllillllllllilllllllllllllllllllilllllllll 




Db 


40381 


CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 


40440 



Qy 541 GCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGATAATAAGTGACAGAGCTGTGA 600 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 40441 GCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGATAATAAGTGACAGAGCTGTGA 40500 

RESULT 2 

US-09-925-065A-649342 

Sequence 649342, Application US/09925065A 
Publication No. US20040181048A1 
GENERAL INFORMATION: 
APPLICANT: Wang, David G. 

TITLE OF INVENTION: Identification and Mapping of Single 
TITLE OF INVENTION: Nucleotide Polymorphisms in the Human Genome 
FILE REFERENCE: 108827.135 

CURRENT APPLICATION NUMBER: US/09/925 , 065A 
CURRENT FILING DATE: 2001-08-08 
PRIOR APPLICATION NUMBER: US 60/243,096 
PRIOR FILING DATE: 2000-10-24 
PRIOR APPLICATION NUMBER: US 60/252,147 
PRIOR FILING DATE: 2000-11-20 
PRIOR APPLICATION NUMBER: US 60/250,092 
PRIOR FILING DATE: 2000-11-30 
PRIOR APPLICATION NUMBER: US 60/261,766 
PRIOR FILING DATE: 2001-01-16 
PRIOR APPLICATION NUMBER: US 60/289,846 
PRIOR FILING DATE: 2001-05-09 
NUMBER OF SEQ ID NOS : 957086 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 649342 
LENGTH: 591 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-925-065A-649342 

Query Match 90.6%; Score 543.6; DB 4; Length 591; 

Best Local Similarity 99.8%; Pred. No. 8.3e-82; 

Matches 543; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 

TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 6 0 

lllllllllllllllllllllllllllllllllllllllllllllllllllllllllill 
TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 107 

ATCTACTTCATTTACTTGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAA 120 

lllllllllllllllhllllllllllllllllMlllllllllilllllllllllllll 

ATCTACTTCATTTACTYGT7UVTATACAGTCATTGACCCTGAGAAAACAATTAAATATTAA 167 
AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 180 

lllillllllllllllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 227 

CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGA 240 

llllllllillllllllllllllllllllllllllllllllllllllllillllllllll 
CTTTTTTAAGGTAAAGAGAATTATAT^TAATTCTGGAGTAATTCCAGAAAACATAAATGA 287 



Qy 


1 


Db 


48 


Qy 


61 


Db 


108 


Qy 


121 


Db 


168 


Qy 


181 


Db 


228 



Qy 



241 



AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 300 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 



Db 



288 AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 347 



Qy 301 GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 360 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 348 GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 407 

Qy 361 ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 408 ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 467 

Qy 421 ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 480 

IIIIMIIIIillllllllllllMIIIIIIIIIIIIMIIMIIIIMIIIMIIIIII 

Db 468 ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 527 

Qy 481 CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 540 

IIIIMilllllllllllllllllllllllllllllllMMIIIIIIIMIIMIIIII 

Db 528 CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 587 

Qy 541 GCAC 544 

MM 

Db 588 GCAC 591 



RESULT 3 

US-09-925-065A-649342 

Sequence 649342, Application US/09925065A 
Publication No. US20050228172A9 
GENERAL INFORMATION: 
APPLICANT: Wang, David G. 

TITLE OF INVENTION: Identification and Mapping of Single 
TITLE OF INVENTION: Nucleotide Polymorphisms in the Human Genome 
FILE REFERENCE: 108827.135 

CURRENT APPLICATION NUMBER: US/09/925 , 065A 
CURRENT FILING DATE: 2001-08-08 
PRIOR APPLICATION NUMBER: US 60/243,096 
PRIOR FILING DATE: 2000-10-24 
PRIOR APPLICATION NUMBER: US 60/252,147 
PRIOR FILING DATE: 2000-11-20 
PRIOR APPLICATION NUMBER: US 60/250,092 
PRIOR FILING DATE: 2000-11-30 
PRIOR APPLICATION NUMBER: US 60/261,766 
PRIOR FILING DATE: 2001-01-16 
PRIOR APPLICATION NUMBER: US 60/289,846 
PRIOR FILING DATE: 2001-05-09 
NUMBER OF SEQ ID NOS : 957086 
SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 649342 
LENGTH: 591 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-09-925-065A-649342 

Query Match 90.6%; Score 543.6; DB 5; Length 591; 

Best Local Similarity 99.8%; Pred. No. 8.3e-82; 

Matches 543; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 



Db 


48 


Qy 


61 


Db 


108 


Qy 


121 


Db 


168 


Qy 


181 


Db 


228 


Qy 


241 


Db 


288 


Qy 


301 


Db 


348 


Qy 


361 


Db 


408 


Qy 


421 


Db 


468 


Qy 


481 


Db 


528 


Qy 


541 


Db 


588 



IIIIIIIIIIIIIIIIIIIIIMIIMIMIMIIIIIIIIIIIIIIIIIIIIIIIIIII 

TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 107 

ATCTACTTCATTTACTTGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAA 12 0 

lllilllllllllllhllllllllllllllllllllllilllllllllllillllllll 
ATCTACTTCATTTACTYGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAA 167 

AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 180 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

AAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTT 227 
CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGA 240 

llllllllllllllllllllllllllllllllllilllllllllllllllllllllllll 

CTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGA 287 

AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 300 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
AGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAG 347 

GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 360 

lllllllllllllllllllllllllllillllllllllllllllllllllllllllllll 

GAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATTACAGCT 407 
ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 420 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAA 467 
ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 480 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

ATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCT 527 

CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 540 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

CATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAG 587 



MM 



RESULT 4 

US-10-840-060-266/C 

Sequence 266, ApplicatAon-~US:/-10840060 
Publication No^^^S 20^5^2.72 43'Al 
GENERAL INFORMATIOnT' 
APPLICANT: Cydacel Limited 
APPLICANT: Deak, Peter 
APPLICANT: Frenz , Lisa 
APPLICANT: Glover, David 
APPLICANT: Midgley, Carol 

TITLE OF INVENTION: Cell Cycle Progression Proteins 
FILE REFERENCE: 10069/2012 

CURRENT APPLICATION NUMBER: US/lO/840 , 060 
CURRENT FILING DATE: 2004-05-05 
PRIOR APPLICATION NUMBER: PCT/GB02/04780 
PRIOR FILING DATE: 2002-10-23 
PRIOR APPLICATION NUMBER: GB 0126506.5 



PRIOR FILING DATE: 2001-11-05 
PRIOR APPLICATION NUMBER: GB 012 83 84.5 
PRIOR FILING DATE: 2001-11-27 
PRIOR APPLICATION NUMBER: GB 0203185.4 
PRIOR FILING DATE: 2002-02-11 
NUMBER OF SEQ ID NOS : 306 
SOFTWARE: Patent In version 3.1 
SEQ ID NO 266 
LENGTH: 3046 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-840-060-266 

Query Match 16.3%; Score 98; DB 10; Length 3046; 

Best Local Similarity 100.0%; Pred. No. le-06; 

Matches 98; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

lllllllillllllllllllllllllllllllilllllllllllllllllllllllllll 

Db 1097 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 1038 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

llllllllllllllllllllllllllllllllllllll 
Db 1037 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 1000 



RESULT 5 

US-10-722-939-5/C 

Sequence 5, Application US/10722939 
Publication No. US20050214771A1 
GENERAL INFORMATION: 
APPLICANT: ROTH, RICHARD B. 
APPLICANT: NELSON, MATTHEW ROBERTS 
APPLICANT: KAMMERER, STEFAN M. 
APPLICANT: BRAUN, ANDREAS 
APPLICANT: RENELAND, RIKARD 

TITLE OF INVENTION: METHODS FOR IDENTIFYING RISK OF BREAST CANCER AND 
TREATMENTS 

TITLE OF INVENTION: THEREOF 
FILE REFERENCE: SEQ-4071-UT 
CURRENT APPLICATION NUMBER: US/lO/722 , 939 
CURRENT FILING DATE: 2003-11-26 
PRIOR APPLICATION NUMBER: 60/429,136 
PRIOR FILING DATE: 2002-11-25 
PRIOR APPLICATION NXJMBER: 60/490,234 
PRIOR FILING DATE: 2003-07-23 
NUMBER OF SEQ ID NOS: 634 
SOFTWARE: Patentin version 3.2 
SEQ ID NO 5 
LENGTH: 3046 
TYPE: DNA 

ORGANISM: Homo sapiens 
US-10-722-939-5 



Query Match 16.3%; Score 97.6; DB 10; Length 3046; 

Best Local Similarity 99.0%; Pred. No. 1.2e-06; 

Matches 97; Conservative 1; Mismatches 0; Indels 0; Gaps 0; 



Qy 



1 



TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 6 0 



Db 



1097 




Qy 



61 



ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 9 8 



Db 



1037 




RESULT 6 

US- 10-106-698 -1545/c 

; Sequence 1545, Application US/10106698 

; Publication No. US20030109690A1 

; GENERAL INFORMATION: 

; APPLICANT: Ruben et al . 

TITLE OF INVENTION: Colon and Colon Cancer Associated Polynucleotides and 
Polypeptides 

; FILE REFERENCE: PA005P1 

; CURRENT APPLICATION NUMBER: US/ 10/106 , 698 

; CURRENT FILING DATE: 2002-03-27 

; PRIOR APPLICATION NUMBER: PCT/USOO/26524 

; PRIOR FILING DATE: 2000-09-28 

; PRIOR APPLICATION NUMBER: US 60/157,137 

; PRIOR FILING DATE: 1999-09-29 

; PRIOR APPLICATION NUMBER: US 60/163,280 

; PRIOR FILING DATE: 1999-11-03 

; NUMBER OF SEQ ID NOS : 8564 

; SOFTWARE: Patent In Ver. 3.0 

; SEQ ID NO 1545 

LENGTH: 1152 

TYPE: DNA 
; ORGANISM: Homo sapiens 
; FEATURE : 

NAME/KEY: misc_f eature 

LOCATION: (6) . . (6) 

OTHER INFORMATION: n equals a,t,g, or c 
NAME/KEY: misc_f eature 
LOCATION: (17) . . (17) 

OTHER INFORMATION: n equals a,t,g, or c 
; NAME /KEY: mi sc_f eature 
LOCATION: (64) . . (64) 

OTHER INFORMATION: n equals a,t,g, or c 

NAME/KEY: misc_f eature 

LOCATION: (1126) . . (1126) 

OTHER INFORMATION: n equals a,t,g, or c 

NAME/ KEY: mi sc_f eature 

LOCATION: (1147) . . (1147) 

OTHER INFORMATION: n equals a,t,g, or c 
US-10-106-698-1545 

Query Match 16.0%; Score 96; DB 6; Length 1152; 

Best Local Similarity 98.0%; Pred. No. 1.7e-06; 

Matches 96; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 



1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIIHIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



Db 870 TACARTAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 811 



Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

llllllllilllilll lllllllllllllllllllll 
Db 810 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 773 



RESULT 7 

US-09-960-352-11234/C 

Sequence 11234, Application US/09960352 
Patent No. US20020137139A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Warren, Wesley C. 
Tao, Nengbing 
Byatt, John C. 
Mathialagan , Nagappan 
TITLE OF INVENTION: NUCLEIC ACID AND OTHER MOLECULES ASSOCIATED WITH 
LACTATION AND 

TITLE OF INVENTION: MUSCLE AND FAT DEPOSITION 
FILE REFERENCE: 16511 . 006/37-21 (10298) C 
CURRENT APPLICATION NUMBER: US/09/960 , 352 
CURRENT FILING DATE: 2001-09-24 
NUMBER OF SEQ ID NOS : 15112 
SEQ ID NO 11234 
LENGTH: 419 
TYPE: DNA 

ORGANISM: Bos taurus 

OTHER INFORMATION: Clone ID: 48 -LIB3058 -052 -Q1-K1-D8 
US-09-960-352-11234 

Query Match 13.8%; Score 82.6; DB 3; Length 419; 

Best Local Similarity 58.2%; Pred. No. 0.00022; 

Matches 145; Conservative 0; Mismatches 104; Indels 0; Gaps 0; 
Qy 89 TCATTGACCCTGAGAAAACAATTAAATATTAAAAATAAAAATTAAATATAATTTAAATAT 148 

I III I II MM II I III MM I III II MM M III 

Db 274 TAATTTAAATTTAAAAAAATATAATATAAAAAAATTTTTTTTTATATTTAATAAAATTAT 215 

Qy 149 AAATTTTCCACTTTACCAATTTTTTGTTACTTCTTTTTTAAGGTAAAGAGAATTATAAAT 208 

IMIIII III llllllll I M Mill II Ml I II I III 

Db 214 AAATTTTAACTTTTTAAAATTTTTTTATTATTATTTTTAAAAAAAAAAAAAAAAAAAAAA 155 

Qy 2 09 AATTCTGGAGTAATTCCAGAAAACATAAATGAAGAAAGTATATCAAAAACTAATATAAAC 268 

II I II I I MM I III II Ml I I Mill M I III 

Db 154 AAAAAAAAAAAAAATAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 95 

Qy 269 AAATACAAACATTTCCCAAGGGCCAGCAAAAGGAACAAAAGAAATAGTGAAAATAATAGA 328 

III I III I M I MM II MM III I MM II I I 

Db 94 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 35 

Qy 329 TTATATAAA 337 

I I III 

Db 34 AAAAAAAAA 26 



RESULT 8 

US-10-473-126-386/C 



Sequence 386, Application US/10473126 
Publication No. US20040234973A1 
GENERAL INFORMATION: 
APPLICANT: Epigenomics AG 

TITLE OF INVENTION: Methods and nucleic acids for the analysis of 
hematopoietic cell 

TITLE OF INVENTION: proliferative disorders 
FILE REFERENCE: 

CURRENT APPLICATION NUMBER: US/lO/473 , 126 
CURRENT FILING DATE: 2003-09-26 
NUMBER OF SEQ ID NOS : 1258 
SEQ ID NO 386 
LENGTH: 8056 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: chemically treated genomic DNA (Homo sapiens) 
US-10-473-126-386 

Query Match 12.8%; Score 76.8; DB 9; Length 8056; 

Best Local Similarity 47.5%; Pred. No. 0.0047; 

Matches 261; Conservative 0; Mismatches 287; Indels 2; Gaps 1; 
Qy 16 TTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAACATCTACTTCATTTAC 75 

II III II II I I II II I II I I II Hill 

Db 2582 TTTTTTAAAAATTTTATTTAAATTTTTAAATTTTTTAATTTATTTTATAATTTAATTTAT 2523 

Qy 76 TTGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAAAAATAAAAATTAAAT 135 

II I I Mill I I II I III I III llllll I 

Db 2522 TTTTTTTTATTTTTCATTAATATTTTAATTATTTTAAAACAAAATAATAAAAAATATATA 2463 

Qy 136 ATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTTCTTTTTTAAGGTAAA 195 

Db 2462 LxLJ.xiJJiLiL^x Uli J^ 2403 

Qy 196 GAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGAAGAAAGTATATCAAA 255 

I INI II I II I I IN INI I III II II II 

Db 2402 AAAAAATAATAAAATTTAATAAATTATTTATAAAAATAAAAAATTATATTTAAAAATTAA 2343 

Qy 256 AACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAGGAACAAAAGAAATAG 315 

I III II lllllllll III II MM II III III I 

Db 2342 ATAAAATTTATTTAAATACAAAAATT7UVTAAATTATTTTAAAAATA7U\TAAATTAAA7^ 2283 

Qy 316 TGAAAA--TAATAGATTATATAAAAATGTTAAATAATAATTACAGCTACCATTTGCATAC 373 

I UN I I III I MM Mill Mill I I I II II 

Db 2282 TTAAAATTTTTTTAATTTATTTTAAATATTAAAAAATAAATTTTTATTAT^AATTAAATTT 2223 

Qy 374 TTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTTATAAAAATAGCAAACATTT 433 

2222 -iLi-,^^^^^^^^^^ 2163 

Qy 434 ATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTTAACCCTCATGACACCTGTA 493 

Ml I I I I I I I MM II I II III 

Db 2162 TTTTTTTATTTTTTATTTTTAATAAAATAAAAAATTATTAAAATTATATTAAATAATTTA 2103 

Qy 494 AGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAAACTGAGGCACAAAGAATGT 553 

Ml Ml llllll III III III Ml III I 



Db 2102 TAATTATTATTTATAATATTTAAAATATTTTTAATAAAATTTTTTAAAAATAAAAAATAT 2043 

Qy 554 AAATAACTTT 563 

I II I I 
Db 2042 TTAAAAATAT 2033 



RESULT 9 

US- 10 -425 -115 -7873 /c 

Sequence 7873, Application US/10425115 
Publication No. US20040214272A1 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



La Rosa, Thomas J. 
Kovalic, David K. 
Zhou, Yihua 
Cao, Yongwei 

TITLE OF INVENTION: Nucleic Acid Molecules and Other Molecules Associated 
With 

TITLE OF INVENTION: Plants 
FILE REFERENCE: 38-21 (53222) B 
CURRENT APPLICATION NUMBER: US/lO/425 , 115 
CURRENT FILING DATE: 2003-04-28 
NUMBER OF SEQ ID NOS : 369326 
SEQ ID NO 7873 
LENGTH: 542 
TYPE: DNA 
ORGANISM: Zea mays 

FEATURE : 

OTHER INFORMATION: Clone ID: MRT4577_107175C . 1 
US-10-425-115-7873 

Query Match 12.8%; Score 76.6; DB 9; Length 542; 

Best Local Similarity 56.6%; Pred. No. 0.0024; 

Matches 142; Conservative 0; Mismatches 109; Indels 0; Gaps 0; 
Qy 105 AACAATTAAATATTAAAAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTAC 164 

344 JJJJj^^LL^^ 285 
Qy 165 CAATTTTTTGTTACTTCTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTC 224 

II III II IIIIIMi III I II I III M I M 

Db 284 AAAAAAAACGTTTTTTTTTTTTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 225 

Qy 225 CAGAAAACATAAATGAAGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCC 284 

I nil I Ml II III I I Mill II I III III I III I 

Db 224 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAA 165 

Qy 285 CAAGGGCCAGCAAAAGGAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTA 344 

II I MM II MM III I MM II II II Mill I 

Db 164 AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAT^AAAAAAAAAAAAAAAAAAAAAAA 105 

Qy 345 AATAATAATTA 355 

II M M II 

Db 104 AAAAAAAAATA 94 



RESULT 10 



US-10-486-706-306/C 

; Sequence 306, Application US/10486706 

; Publication No. US20050071088A1 

; GENERAL INFORMATION: 

; APPLICANT: LANDFIELD, PHILIP W. 

; APPLICANT: BLALOCK, ERIC M. 

; APPLICANT: CHEN, KUEY-CHU 

; APPLICANT: FOSTER, THOMAS C. 

; TITLE OF INVENTION: GENE EXPRESSION PROFILE BIOMARKERS AND THERAPEUTIC 
TARGETS FOR 

; TITLE OF INVENTION: BRAIN AGING AND AGE-RELATED COGNITIVE IMPAIRMENT 
; FILE REFERENCE: 50229-426 

; CURRENT APPLICATION NUMBER: US/lO/486 , 706 

; CURRENT FILING DATE: 2004-02-13 

; PRIOR APPLICATION NUMBER: PCT/US02/25607 

; PRIOR FILING DATE: 2002-08-13 

; PRIOR APPLICATION NUMBER: US 60/311,343 

; PRIOR FILING DATE: 2001-08-13 

; NUMBER OF SEQ ID NOS : 461 

; SOFTWARE: Patentin version 3.2 

; SEQ ID NO 306 

LENGTH: 3256 

TYPE: DNA 
; ORGANISM: Rattus norvegicus 
US-10-486-706-306 



Query Match 12.7%; Score 76.2; DB 10; Length 3256; 

Best Local Similarity 86.6%; Pred. No. 0.0047; 

Matches 84; Conservative 0; Mismatches 13; Indels 0; Gaps 0; 

Qy 2 ACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAACA 61 

II II II II lllllllllll lllllllllllllllll II lllllllllllllll 
Db 1359 ACTATCGATCCAGCTTCTTTCAATGCTTCAACTGCTTTGCTGTGGGTTACATCACGAACA 1300 



Qy 62 TCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

III lllllllllll I IIIIIIIIIIII MM 

Db 1299 TCTGCTTCATTTACTCTCAGTATACAGTCATTTACCC 1263 



RESULT 11 

US-10-980-571-14/C 

; Sequence 14, Application US/10980571 

; Publication No. US20050138675A1 

; GENERAL INFORMATION: 

; APPLICANT: Pfizer, Inc. 

; TITLE OF INVENTION: Method For Determining Cardiotoxicity 
; FILE REFERENCE: PC26200A 

; CURRENT APPLICATION NUMBER: US/lO/980 , 571 

; CURRENT FILING DATE: 2004-11-02 

; PRIOR APPLICATION NUMBER: 60/516,774 

; PRIOR FILING DATE: 2003-11-03 

; NUMBER OF SEQ ID NOS: 18 

; SOFTWARE: Patentin version 3.3 

; SEQ ID NO 14 

LENGTH: 3256 

TYPE: DNA 
; ORGANISM: Rattus norvegicus 



US-10-980-571-14 



Query Match 12.7%; Score 76.2; DB 10; Length 3256; 

Best Local Similarity 86.6%; Pred. No. 0.0047; 

Matches 84; Conservative 0; Mismatches 13; Indels 0; Gaps 0; 
Qy 2 ACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAACA 61 

II II II II lllllllllll lllllllllllllllll II IMIIMIIIIIIII 

Db 1359 ACTATCGATCCAGCTTCTTTCAATGCTTCAACTGCTTTGCTGTGGGTTACATCACGAACA 1300 

Qy 62 TCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

III lllllllllll I llllllllllll MM 

Db 1299 TCTGCTTCATTTACTCTCAGTATACAGTCATTTACCC 1263 



RESULT 12 

US-10-473-126-240/C 

Sequence 240, Application US/10473126 
Publication No. US20040234973A1 
GENERAL INFORMATION: 
APPLICANT: Epigenomics AG 

TITLE OF INVENTION: Methods and nucleic acids for the analysis of 
hematopoietic cell 

TITLE OF INVENTION: proliferative disorders 
FILE REFERENCE: 

CURRENT APPLICATION NUMBER: US/lO/473 , 126 
CURRENT FILING DATE: 2003-09-26 
NUMBER OF SEQ ID NOS : 1258 
SEQ ID NO 240 
LENGTH: 8056 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: chemically treated genomic DNA (Homo sapiens) 
US-10-473-126-240 

Query Match 12.5%; Score 75.2; DB 9; Length 8056; 

Best Local Similarity 47.3%; Pred. No. 0.0088; 

Matches 260; Conservative 0; Mismatches 288; Indels 2; Gaps 1; 

Qy 16 TTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAACATCTACTTCATTTAC 75 

II Ml II II I I II II I II I I II Mill 

Db 2582 TTTTTTAAAAATTTTATTTAAATTTTTTU^TTTTTTAATTTATTTTATAATTTAATTTAT 2523 

Qy 76 TTGTAATATACAGTCATTGACCCTGAGAAAACAATTAAATATTAAAAATAAAAATTAAAT 135 

II I I MM I I I I I III I II llllll I 

Db 2522 TTTTTTTTATTTTTCGTTAATATTTTAATTATTTTAAAACGAAATAATAAAAAATATATA 2463 

Qy 136 ATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTGTTACTTCTTTTTTAAGGTAAA 195 

II I I I III II MM III I I II llllll I I 

Db 2462 AATAAAAATAAAAAAAAATTAATTTTTAAAAATAAATAAATTATTTTTTTTTTAATTTTC 24 03 

Qy 196 GAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACATAAATGAAGAAAGTATATCAAA 255 

II II II II I 11 II III MM llllll MM 

Db 2402 GAAAAATAATAAAATTTAATAAATTATTTATAAAAATAAAAAATTATATTTAAAAATTAA 2343 

Qy 256 AACTAATATAAACAAATACAAACATTTCCCAAGGGCCAGCAAAAGGAACAAAAGAAATAG 315 
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US-10-722-939-l-T39977_39901-40500 
600 

1 tacaatagaccctgcttctt aataagtgacagagctgtga 600 
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DA501101 DA501101 




6 


96 


.4 


16. 


1 


482 


1 


AA831793 


AA831793 oa59dl0.S 




7 


96 


.4 


16. 


1 


531 


3 


BU741256 


BU741256 UI-E-EJO- 




8 


96 


.4 


16 . 


1 


532 


3 


BM727000 


BM727000 UI-E-EJO- 




9 


96 


. 4 


16 . 


1 


578 


9 


DA315109 


DA315109 DA315109 


Q 


10 


96 


.4 


16 . 


1 


580 


9 


DA262508 


DA262508 DA262508 


c 


11 


96 


.4 


16. 


1 


842 


7 


BF677732 


BF677732 602085430 


c 


12 


96 


.4 


16 . 


1 


860 


2 


BG164149 


BG16414 9 602343020 


Q 


13 


96 


.4 


16 . 


1 


2675 


14 


AY420526 


AY420526 Pan trogl 




14 


94 


.8 


15 . 


8 


447 


1 


AA480937 


AA480937 aa28c06.r 


c 


15 


94 


.8 


15 . 


8 


582 


3 


BP229026 


BP229026 BP229026 


Q 


16 


94 


.8 


15 . 


8 


590 


3 


BP211156 


BP211156 BP211156 


c 


17 


92 


.8 


15 . 


5 


601 


8 


CR762761 


CR762761 DKFZp469I 


c 


18 


90 


15 . 


0 


653 


9 


DN267047 


DN267047 LIB30320 




19 


87 


.2 


14 . 


5 


1309 


7 


BE420736 


BE420736 HWM002.A0 




20 


86 


.8 


14 . 


5 


732 


4 


BX668084 


BX668084 BX668084 




21 


84 


.2 


14 . 


0 


794 


11 


AZ530415 


AZ530415 ENTBXOITR 




22 


83 


.8 


14 . 


0 


1297 


10 


DV778934 


DV778934 Hw Fat 9 




23 


83 


.2 


13 , 


9 


1101 


14 


CNSOOEVL 


AL069706 Drosophil 


c 


24 


82 


.8 


13 . 


8 


867 


14 


CNS00CX5 


AL060052 Drosophil 




25 


82 


.6 


13 . 


8 


866 


5 


CF289423 


CF289423 AGENCOURT 


c 


26 


82 


.4 


13 . 


7 


1355 


14 


AG346348 


AG346348 Mus muscu 


c 


27 


81 


.8 


13 . 


6 


1116 


2 


BM477099 


BM477099 AGENCOURT 




28 


81 


.4 


13 . 


6 


464 


9 


CX571966 


CX571966 TTE000181 


Q 


29 


81 


13 . 


5 


729 


8 


CV486140 


CV486140 AGENCOURT 


Q 


30 


80 


.4 


13 . 


4 


1114 


3 


BQ422058 


BQ422058 AGENCOURT 




31 


80 


13 . 


3 


1203 


14 


CNS015WU 


AL106008 Drosophil 


Q 


32 


79 


.6 


13 . 


3 


993 


12 


CL113812 


CL113812 ISB1-59A1 


Q 


33 


79 


.4 


13 . 


2 


839 


3 


B0151187 


BQ151187 NF048A07L 




34 


79 


.2 


13 . 


2 


1101 


14 


CNS0183Y 


AL108856 Drosophil 




35 


79 


13 . 


2 


983 


1 


AJ926697 


AJ926697 AJ926697 


c 


36 


79 


13 . 


2 


1015 


12 


CL135318 


CL135318 ISB1-106F 




37 


78 


.6 


13 , 


1 


996 


14 


CNSOOFUH 


AL071063 Drosophil 




38 


78 


.6 


13 . 


1 


1101 


14 


CNS003BD 


AL064091 Drosophil 




39 


78 


.4 


13 . 


1 


759 


14 


CNS06QXV 


AL411257 T7 end of 




40 


78 


.4 


13 . 


1 


1007 


10 


DV050335 


DV050335 DAY35S 08 


Q 


41 


78 


.2 


13 . 


0 


1231 


12 


CG754046 


CG754046 P049-2-D0 




42 


77 


.8 


13 . 


0 


1101 


14 


CNS00LT2 


AL078714 Drosophil 


Q 


43 


77 


.8 


13 . 


0 


1255 


14 


AG34 6372 


AG34 6372 Mus muscu 


Q 


44 


77 


.6 


12 . 


9 


1007 


9 


DN566957 


DN566957 92914460 




45 


77 


.6 


12 . 


9 


1088 


1 


AJ928262 


AJ928262 AJ928262 


Q 


46 


77 


.6 


12 . 


9 


1283 


13 


CL641341 


CL641341 CH213-901 




47 


77 


.4 


12 . 


9 


658 


14 


CNS03NK1 


AL252154 Tetraodon 




48 


77 


.4 


12 . 


9 


661 


10 


DR722785 


DR722785 AGENCOURT 


c 


49 


77 


.4 


12 . 


9 


764 


5 


CF289381 


CF289381 AGENCOURT 


c 


50 


77 


.2 


12 . 


, 9 


811 


1 


AL514901 


AL514901 AL514901 


Q 


51 




77 


12 . 


, 8 


1249 


14 


AG347294 


AG347294 Mus muscu 


Q 


52 


76 


.8 


12 . 


, 8 


834 


14 


CNS00868 


AL051779 Drosophil 


Q 


53 


76 


. 8 


12 . 


8 


1101 


14 


CNS017V2 


AL108536 Drosophil 


/-t 
\^ 




76 


.8 


1 9 
J. ^ • 


Q 

. O 


J. -L ^ W 


14 


AG365432 


AG365432 Mus muscu 


c 


55 


76 


.6 


12 . 


,8 


1335 


14 


AG290024 


AG290024 Mus muscu 


c 


56 


76 


.4 


12. 


,7 


989 


11 


AZ549932 


A2549932 ENTFP96TF 


c 


57 


76 


.4 


12. 


,7 


1155 


10 


DW671774 


DW671774 CNB367-C0 




58 


76 


.2 


12. 


,7 


341 


1 


AA849677 


AA849677 EST192444 




59 


76 


.2 


12. 


.7 


410 


1 


AI144926 


AI144926 UI-R-BTO- 




60 


76 


.2 


12, 


,7 


801 


8 


CV485384 


CV485384 AGENCOURT 





61 


76 


.2 


12 


, 7 


826 


8 


CV483860 


CV483860 AGENCOURT 


c 


62 


76 


12 


. 7 


1101 


14 


CNS016LI 


AL106896 Drosophil 


c 


63 


75 


.8 


12 


.6 


1101 


14 


CNS00EO7 


AL069440 Drosophil 


c 


64 


75 


.8 


12 


.6 


1269 


14 


AG386989 


AG386989 Mus muscu 


c 


65 


75 


.8 


12 


. 6 


1880 


14 


AG430399 


AG430399 Mus muscu 




66 


75 


. 6 


12 


. 6 


527 


4 


CA726232 


CA726232 wetls.pkO 


c 


67 


75 


.6 


12 


. 6 


576 


14 


CNS035N7 


AL228940 Tetraodon 


c 


68 


75 


.6 


12 


. 6 


745 


1 


AJ445424 


AiT445424 AJ445424 


c 


69 


75 


. 6 


12 


. 6 


990 


14 


CNS006OI 


AL065624 DrosoDhil 




70 


75 


. 6 


12 


. 6 


1101 


14 


CNS002FG 


AL062437 Drosonhil 


Q 


71 


75 


.6 


12 


. 6 


1101 


14 


CNSOOETW 


AIi069847 Drosonhil 


Q 


72 


75 


.2 


12 


. 5 


897 


14 


CNS07ABZ 


AL4363 89 T7 end of 


c 


73 


74 


. 8 


12 


. 5 


947 


12 


CL112501 


CL112501 ISB1-57D2 


c 


74 


74 


. 8 


12 


. 5 


1244 


14 


AG361248 


AG36124 8 Mus muscu 




75 


74 


. 8 


12 


. 5 


1260 


13 


CL491610 


CL491610 SAIL 559 




76 


74 


.8 


12 


. 5 


1522 


12 


CL128484 


CL128484 ISB1-94I1 


c 


77 


74 


. 6 


12 


. 4 


500 


9 


DN386720 


DN386720 LIB3892-0 




78 


74 


. 6 


12 


.4 


607 


8 


CV490737 


CV490737 AGENCOURT 


Q 


79 


74 


.6 


12 


. 4 


891 


14 


CNS009JU 


AL053767 DTOSonhil 




80 


74 


. 6 


12 


. 4 


1058 


12 


CL077132 


CL0771'^2 rH?1fi-141 

V«J>J V/ r 1 A. J ^ -L. \J J. *X ^ 




81 


74 


. 6 


12 


. 4 


1101 


14 


CNS 003 9G 


AL063921 DTOSonhil 




82 


74 


.6 


12 


. 4 


1542 


14 


AG386981 


AG3 86981 Mus muscu 




83 


74 


.4 


12 


. 4 


734 


14 


CNSOIOMP 


AL099163 Drosoohil 


c 


84 


74 


.4 


12 


. 4 


749 


11 


AQ324504 


AQ324504 mgxb0018L 


c 


85 


74 


.4 


12 


. 4 


920 


9 


DN564190 


DN564190 90895070 




86 


74 


.4 


12 


.4 


945 


14 


CNS04D0K 


AL285149 Tetraodon 


c 


87 


74 


.4 


12 


.4 


946 


13 


CZ934132 


CZ934132 251587 To 


c 


88 


74 


.4 


12 


. 4 


1027 


14 


CNS02T50 


AL2 12733 Tetraodon 


c 


89 


74 


.4 


12 


. 4 


1029 


14 


CNSOIZGM 


AL174271 Tetraodon 


c 


90 


74 


.4 


12 


.4 


1260 


14 


AG320287 


AG320287 Mus muscu 




91 


74 


. 4 


12 


. 4 


1295 


12 


CG757539 


CG757539 P052-4-E0 


c 


92 


74 


. 2 


12 


. 4 


928 


14 


CNSOODKY 


AL071865 Drosonhil 




93 


74 


.2 


12 


. 4 


934 


13 


CL506649 

Xta-" \J \J \J ~ ^ 


CL506649 SAIL 770 




94 


74 


.2 




4 


1 074. 






T%7.696Q'^6 QP n;30nQ 


c 


95 


74 


.2 


12 


.4 


1225 


14 


CNS0161D 


AL106171 Drosophil 


c 


96 




74 


12 


.3 


1101 


14 


CNS0145U 


AL103740 Drosophil 


c 


97 


73 


.8 


12 


.3 


998 


12 


BZ695174 


BZ695174 SP ^Ba005 


c 


98 


73 


.8 


12 


.3 


1025 


14 


CNS014J2 


AL104216 Drosophil 


c 


99 


73 


. 8 


12 


.3 


1227 


14 


AG430010 


AG4 30010 Mus muscu 




100 


73 


.6 


12 


.3 


1184 


14 


CNS04P4P 


AL3 00850 Tetraodon 



ALIGNMENTS 



RESULT 1 

BG567499/C 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



BG567499 740 bp mRNA linear EST lO-APR-2001 

602586233F1 NIH_MGC_76 Homo sapiens cDNA clone IMAGE : 4715024 5», 
mRNA sequence. 
BG567499 

BG567499.1 GI: 13575152 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



Hominidae ; Homo . 

1 (bases 1 to 740) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: CLONETECH Laboratories, Inc. 
cDNA Library Preparation: CLONETECH Laboratories, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sec[uencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consort ium/LLNL at: 
http : //image . llnl .gov 
Plate: LLCM1561 row: a column: 09 
High quality sequence stop: 474. 
Location/Qualifiers 
1. .740 

/organisms "Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon:9606" 
/clone=" IMAGE: 4715024" 
/lab_host="DH10B (Tl phage -resistant ) " 
/clone_lib= "NIH_MGC_76 " 

/note="Organ: liver; Vector: pDNR-LIB (Clontech) ; Site_l: 
Sfil (ggccgcctcggcc) ; Site_2 : Sfil (ggccattatggcc) ; 5' and 
3' adaptors were used in cloning as follows: 5' adaptor 
sequence: 5 ' -CACGGCCATTATGGCC-3 ' and 3' adaptor sequence: 
5' -ATTCTAGAGGCCGAGGCGGCCGACATG-dT(30)BN-3 • (where B = A, 
C, or G and N = A, C, G, or T) . Average insert size 1.85 
Icb (range 1.0-4.0 kb) . 15/15 colonies contained inserts 
by PCR. This library was enriched for full-length clones 
and was constructed by Clontech Laboratories (Palo Alto, 
CA) . Note: this is a NIH_MGC Library." 



ORIGIN 



Query Match 23.1%; 
Best Local Similarity 99.3%; 
Matches 139; Conservative 



Score 13 8.4; DB 2; 
Pred. No. 8.6e-14; 
0; Mismatches 1; 



Length 740; 



Indels 



0 ; Gaps 



0; 



Qy 



Db 



461 TTTCTTGGTATTTTAACCCTCATGACACCTGTAAGCTTAATATATATTTTAATCCCTATT 520 

III IIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
452 TTTTTTGGTATTTTAACCCTCATGACACCTGTAAGCTTAATATATATTTTAATCCCTATT 393 



Qy 

Db 

Qy 

Db 



521 TCACAGATGGAGAAACTGAGGCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGAT 580 

IIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIII 
392 TCACAGATGGAGAAACTGAGGCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGAT 333 



600 



581 AATAAGTGACAGAGCTGTGA 

llllllllllllllllllll 
332 AATAAGTGACAGAGCTGTGA 313 



RESULT 2 
CC504900 

LOCUS CC504 900 906 

DEFINITION CH240 345A2.T7 CHORI-240 Bos 



bp DNA linear GSS 17-JUN-2003 

taurus genomic clone CH240_345A2, 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



Smailus.D. , Prabhu, A. -L. , 
Olson, T. , Mayo,M. , 
, Guin,R. , Chan, A. , 
, Jones, S., 
Barris, W. , 



Chiu,R. , 



ORIGIN 



genomic survey sequence. 
CC504900 

CC504900.1 GI: 31823193 
GSS. 

Bos taurus (cattle) 
Bos taurus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Laurasiatheria; Cetartiodactyla; Ruminantia; 
Pecora; Bovidae; Bovinae; Bos. 
1 (bases 1 to 906) 

Holt,R., Stott^J., Yang,G., Barber, S., 
Tsai,M., Cloutier,A., Lee,D., Girn,N., 
Butterf ield, Y. , Kirkpatrick, R. , Liu, J. , 
Mathewson,C. , Wye,N., Masson,A., Brown- John, M . 
Schein,J., Marra,M. , de Jong, P., McWilliam, S . , 
Dalrymple,B.P. and Tellam,R. 

Bovine BAC End Sequences from Library CHORI-240, PLATES 294 to 398 
Unpublished (2003) 

Other_GSSS: CH240_345A2.TARBAC13P2 

Contact : Rob Holt 

Sequencing 

The British Columbia Cancer Agency Genome Science Centre 
600 W. 10th Ave, Vancouver, British Columbia, Canada V5Z 4E6 
Tel: 604-877-6085 
Fax: 604-877-6276 
Email : rholt@bcgsc . ca 

Clones are derived from the bovine BAC library CHORI-240 
(http://www.chori.org/bacpac/bovine240.htm). For BAC library 
availability, please contact Pieter de Jong (pdejong@mail.cho.org) . 
Clones may be purchased from BACPAC Resources 

(http://www.chori.org/bacpac/ordering_information.htm) . This work 
was undertaken as part of the International Bovine BAC Mapping 
Consortium (IBBMC) by CSIRO Livestock Industries, Australia and the 
British Columbia Genome Sciences Centre, Canada. 
Plate: 345 row: A column: 2 
Seq primer: T7 
Class : BAC ends . 

Location/Qualif iers 

1. .906 

/organism="Bos taurus" 
/mol_type= "genomic DNA" 
/strains " breed : Here ford " 
/db_xref = " taxon : 9913 " 
/ c 1 one = " CH2 4 0_3 4 5 A2 " 
/sex= "Male" 
/cell_type= "Blood" 
/ c lone_l ib= " CHORI -240" 

/note= "Vector: pTARBACl.3; Site_l : Mbol; Site_2 : Mbol; 
Hereford bull LI Domino 99375; CHORI-240 Bovine BAC 
library (Male) produced by Pieter de Jong" 



Query Match 22.8%; Score 136.8; DB 12; Length 906; 

Best Local Similarity 63,6%; Pred. No. 1.6e-13; 

Matches 347; Conservative 0; Mismatches 147; Indels 52; Gaps 7; 



Qy 



55 ACGAACATCTACTTCATTTACTTGTAATATACAGTCATTGACCCTGAGAAAACAATTAAA 114 



Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



IIIIIIMIM IIIIMIIII I II llllllllllllll II II III lllllll 

ACGAACATCTATTTCATTTACTCGCAAAATACAGTCATTGACTCTAAGGAAA-AATTAAA 



59 



115 TATTAAAAATAAAAATTAAATATAATTTAAATATAAATTTTCCACTTTACCAATTTTTTG 174 

II MM III Mill II II Mill Mill IIMII IIIIMIM 

60 AATAAAAATGAAATATTAA AAATTTAATATCGATTTTTCACTTTGCCAATTTTTAT 115 

175 TTACTTCTTTTTTAAGGTAAAGAGAATTATAAATAATTCTGGAGTAATTCCAGAAAACAT 234 

II I Mill II III I I II II i III I II 

116 TTTCACATTTTTAAGGTTAACTACTACGATTTTTTCTGAGAAGAAACTTCATGTTAA 172 



235 



294 



AAATGAAGAAAGTATATCAAAAACTAATATAAACAAATACAAACATTTCCCAAGGGCCAG 

I I IMIIIIII III MM IIIMIIIIIIIIII lllllllll 

173 ATTTCAATATATCAAATACTTATAT AAATACAAACATTTCTCAAGGGCCAT 223 



295 



354 



CAAAAGGAACAAAAGAAATAGTGAAAATAATAGATTATATAAAAATGTTAAATAATAATT 

MM I M I MM MM Ml M MMM MMM MMI 

224 CAAAGGAAATAGTGCAAAT AATAGATTATTTTACAAAAATATTAAATGATAATG 277 

355 ACAGCTACCATTTGCATACTTCCTGCATGCTTATTCTGTGTAGGCATTAAGACACACTTT 414 

MMM III MM I I I Mill 

278 ACAGCT GTCTATGTACAGAAATTAAAATACTTT 310 

415 ATAAAAATAGCAAACATTTATTTAGCACTAACCACATGCCAGGCACTTTCTTGGTATTTT 474 

I Mill II Ml I llllllll Mill MM II I MM II III 

311 AGAAAAACAGAAAATACTTATTTAGTACTAATCACAAGCTACAAACTTCTTTTAGTGTTT 370 
475 AACCCTCATGACACCTGTAAGCTTAATATATATTTTAATCCCTATTTCACAGATGGAGAA 534 

I II I I II Ml I llllllllll III Mill II 

371 TAACCCTTACAACAACCTGTAAGGTGGATCTATCTATATCCCTATTTTACACATGGAAAA 430 

535 ACTGAGGCACAAAGAATGTAAATAACTTTCCTAAGGCCACCCAGATAATAAGTGACAGAG 5 94 

MIIIIMMIIII I IIIMMI MM II IMIIIIIIIMII I MM 

431 ACTGAGGCACAAAG--TTTAAATAACATTCCCAAAGCCACCCAGATAATGACCAGTAGAG 4 88 

595 CTGTGA 600 

I MM 
489 CAGTGA 494 



RESULT 3 

H96495 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



H96495 462 bp mRNA linear EST 25-NOV-1996 

yt98h02.rl Soares_j)ineal_gland_N3HPG Homo sapiens cDNA clone 
IMAGE:232371 5' similar to SP:A45436 A45436 
SAP90=SyNAPSE -ASSOCIATED PROTEIN - ;, mRNA sequence. 
H96495 

H96495.1 GI:1109963 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 462) 

Hillier,L., Clark, N., Dubuque, T., Elliston,K-, Hawkins, M., 
Holman,M., Hultman,M., Kucaba,T., Le,M., Lennon,G., Marra,M., 



TITLE 
JOURNAL 
COMMENT 



FEATURES 

source 



Parsons, J., Rifkin,L., Rohlfing,T., Soares,M., Tan,F., 
Trevaskis,E. , Waterston, R. , Williamson, A. , Wohldmann, P. and 
Wilson, R. 

The WashU-Merck EST Project 
Unpublished (1995) 

Contact: Wilson RK 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, MO 63108 

Tel: 314 286 1800 

Fax: 314 286 1810 

Email : est@wat son . wustl . edu 

High quality sequence stops: 333 

Source: IMAGE Consortium, LLNL 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Possible reversed clone: similarity on wrong strand 
Insert Length: 2241 Std Error: 0.00 
Seq primer: M13RP1. 

Location/Qualifiers 

1. .462 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="GDB: 3862362" 
/ db_xr e f = " t axon : 9 6 0 6 " 
/clone=" IMAGE: 232371" 

/lab_host="DH10B (ampicillin resistant) " 
/clone_lib="Soare s_p i ne a l_g 1 and_N3 HPG " 

/note= "Organ: pineal gland; Vector: pT7T3D (Pharmacia) 
with a modified polylinker; Site_l: Not I; Site_2 : Eco RI; 
1st strand cDNA was primed with a Not I - oligo(dT) primer 
[5 ' TGTTACCAATCTGAAGTGGGAGCGGCCGCCGTTTTTTTTTTTTTTTTTTT 
3'], double -Stranded cDNA was size selected, ligated to 
Eco RI adapters (Pharmacia) , digested with Not I and 
cloned into the Not I and Eco RI sites of a modified pT7T3 
vector (Pharmacia) . Library constructed by Bento Soares 
and M.Fatima Bonaldo. " 



ORIGIN 



Query Match 16.1%; 
Best Local Similarity 98.0%; 
Matches 98; Conservative 



Score 96.8; DB 10; 
Pred. No. l.le-06; 
0; Mismatches 2; 



Length 462; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIMMIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIMIII 

199 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 258 



Qy 

Db 



61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCCTG 100 

IMIIIIIIIIIIIII IIIIMIIIIIIIIIIIIIII I 
259 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCCGG 298 



RESULT 4 
AA281170 

LOCUS AA281170 361 bp mRNA linear EST 14-AUG-1997 

DEFINITION ztOlbll.rl NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:711837 5' 

similar to TR:G558438 G558438 HOMOLOG OF DROSOPHILA DISCS LARGE 

PROTEIN, ISOFORM 1. ;, mRNA sequence. 



ACCESSION AA281170 

VERSION AA281170.1 01:1923851 

KEYWORDS EST. 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
REFERENCE 1 (bases 1 to 361) 

AUTHORS NCI -CGAP ht tp : / /www . ncbi . nlm . nih . gov/ncicgap . 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rOmail .nih.gov 

This clone is available royalty-free through LLNL ; contact the 
IMAGE Consortium (info@image.llnl.gov) for further information. 
Possible reversed clone: similarity on wrong strand 
Insert Length: 1961 Std Error: 0.00 
Seq primer: -28ml3 rev2 ET from Amersham 
High quality sequence stop: 360. 
FEATURES Location/Qualifiers 
source 1. .361 

/organisms "Homo sapiens" 

/mol_type= "mRNA" 

/db_xref = " taxon : 96 06 " 

/clone=" IMAGE : 711837 " 

/tissue_type= "germinal center B cell" 

/lab_host="DH10B" 

/clone_lib="NCI_CGAP_GCBl" 

/note= "Vector: pT7T3D-PacI; Site_l : Not I; Site_2 : Eco RI; 
1st strand cDNA was prepared from human tonsillar cells 
enriched for germinal center B cells by flow sorting 

(CD20+, IgD-), provided by Dr. Louis M. Staudt (NCI), Dr. 
David Allman (NCI) and Dr. Gerald Marti (CBER) . cDNA 
synthesis was primed with a Not I - oligo(dT) primer 

[5 » -TGTTACCAATCTGAAGTGGGAGCGGCCGCCTCATTTTTTTTTTTTTTTTTT-3 ' 
] . Double -stranded cDNA was ligated to Eco RI adaptors 

(Pharmacia) , digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Scares and M. Fatima Bonaldo." 

ORIGIN 

Query Match 16.1%; Score 96.4; DB 1; Length 361; 

Best Local Similarity 99.0%; Pred. No. 1.3e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIIIIIIIIMIIIIIMIIMIIIIIIIMIMIIIIIIIIIMIIIIIIIIIIIIII 

Db 34 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 93 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

Illlllllllilllll lllllllllllllllllllll 
Db 94 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 131 



RESULT 5 

DA501101/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



DA501101 362 bp mRNA linear EST 08-NOV-2005 

DA501101 FCBBF3 Homo sapiens cDNA clone FCBBF3015237 5', mRNA 
sequence . 
DA501101 

DA501101 .1 GI : 81184123 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Homi n i dae ; Homo . 
1 (bases 1 to 362) 

Kimura,K., Wakamatsu, A. , Suzuki, Y., Ota,T., Nishikawa, T. , 
Yamashita, R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R-, Kushida,N., 
Yoneyama , T . , Ot suka , R . , Kanda , K . , Yokoi , T . , Kondo , H . , Wagat suma , M . , 
Murakawa,K. , Ishida,S. , Ishibashi,T. , Takahashi-Fujii, A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 

Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 

Fax: 81-438-52-3986 
Email : f Ij -cdna@nif ty . com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5* -end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 '-end one 
pass sequencing: RAB. 

Location/Qualif iers 

1. .362 

/organisms "Homo sapiens" 
/mol_type= "mRNA" 
/db_xref = " taxon : 9606 " 
/ c 1 one = " FCBB F3 015237" 
/ 1 i s sue_type= " brain " 
/dev_stage= " fetal " 
/ c lone_l ib= " FCBBF3 " 
/note= "Vector : pME18SFL3" 



ORIGIN 



Query Match 16.1%; 
Best Local Similarity 99.0%; 
Matches 97; Conservative 



Score 96.4; DB 9; 
Pred. No. 1.3e-06; 
0; Mismatches 1; 



Length 362; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIIIIIIIIMIIIIIIIIIIIIIIIIIillllMIIIMIIIIIIIIIIIIIIIIIII 
132 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 73 



Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

Illlllllllllllll lllllllllllllllllllll 
Db 72 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 35 



RESULT 6 
AA831793 

LOCUS AA831793 482 bp mRNA linear EST 05-MAR-1998 

DEFINITION oa59dl0.sl NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE : 1309267 3' 

similar to SW:SP97_HUMAN Q12959 PRESYNAPTIC PROTEIN SAP97 mRNA 

sequence . 
ACCESSION AA831793 
VERSION AA831793.1 GI: 2904892 

KEYWORDS EST . 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
REFERENCE 1 (bases 1 to 482) 

AUTHORS NCI-CGAP http://www.ncbi.nlm.nih.gov/ncicgap. 

TITLE National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 
JOURNAL Unpublished (1997) 
COMMENT Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: Louis M. Staudt, M.D., Ph.D., David Allman, 
Ph.D., Gerald Marti, M.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing Center 
Clone distribution: NCI-CGAP clone distribution information can be 
found through the I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl . gov/bbrp/ image/ image .html 
Insert Length: 921 Std Error: 0.00 
Seq primer: -40ml3 fwd. ET from Amersham 
High quality sequence stop: 467. 
FEATURES Location/Qualifiers 
source 1. .482 

/organisms "Homo sapiens" 

/mol_type= "mRNA" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone="IMAGE: 1309267" 

/tissue_type= "germinal center B cell" 

/lab_host="DH10B" 

/clone_lib= "NCI_CGAP_GCB1 " 

/note= "Vector: pT7T3D-PacI; Site_l: Not I; Site_2 : Eco RI; 
1st strand cDNA was prepared from human tonsillar cells 
enriched for germinal center B cells by flow sorting 

(CD20+, IgD-), provided by Dr. Louis M. Staudt (NCI), Dr. 
David Allman (NCI) and Dr. Gerald Marti (CBER) . cDNA 
synthesis was primed with a Not I - oligo(dT) primer 

[5 • -TGTTACCAATCTGAAGTGGGAGCGGCCGCCTCATTTTTTTTTTTTTTTTTT-3 ' 
] . Double -stranded cDNA was ligated to Eco RI adaptors 

(Pharmacia) , digested with Not I and cloned into the Not I 



and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Scares and M. Fatima Bonaldo." 



ORIGIN 



Query Match 16.1%; Score 96.4; DB 1; Length 482; 

Best Local Similarity 99.0%; Pred. No. 1.3e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 

Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIMIIIIIIIIIIIIillllllllllllllllllllllllllllllllllllllllll 
Db 21 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 80 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

Illilllllillllll lllllllillllllllillll 
Db 81 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 118 



RESULT 7 
BU741256 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



BU741256 531 bp mRNA linear EST lO-OCT-2002 

UI-E-EJO-air-d-04-O-UI .si UI-E-EJO Homo sapiens cDNA clone 
UI-E-EJO-air-d-04-O-UI 3', mRNA sequence. 
BU741256 

BU741256 . 1 GI : 23684413 
EST. 

Homo sapiens (human) 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 

Hominidae ; Homo . 
1 (bases 1 to 531) 

Bonaldo , M . F . , Lennon , G . and Soares , M . B . 

Normalization and subtraction: two approaches to facilitate gene 
discovery 

Genome Res. 6 (9), 791-806 (1996) 
8889548 

Contact : Soares , MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, lA 52242, USA 

Tel: 319 335 8250 

Fax: 319 335 9565 

Email : bento- soaresOuiowa . edu 

Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Soares, University of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, University of Iowa 
DNA Sequencing by: Dr. M. Bento Soares, University of Iowa 
Clone Distribution: Researchers may obtain clones from Research 

Genetics (www.resgen.com) . 

Seq primer: M13 FORWARD 

POLYA=Yes . 

Locat ion/ Qual i f ier s 
1. .531 

/organism="Homo sapiens" 

/ mo l_t ype = " mRNA " 

/db xref="taxon:9606" 



/clone="UI-E-EJ0-air-d-04-0-UI " 

/tissue_type=" fetal eyes, lens, eye anterior segment, 
optic nerve, retina. Retina Foveal and Macular, RPE and 
Choroid" 

/dev_stage=" fetal and adult" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/ c 1 one_l ib= " UI - E - E JO " 

/note="Organ: eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l: EcoR I; Site_2 : Not I; 
UI-E-EJO is a subtracted cDNA library constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first -strand cDNA contains a library tag sequence that is 
located between the Not I site and the (dT) 18 tail . The 
sequence tags for this library are: fetal eyes, 
AGAATCAAGA; lens, CGATTAGCGA; eye anterior segment, 
AATGCCGCAT; optic nerve, CCATTAAGTG; retina, CCGCG; Retina 
Foveal and Macular, GTCC; RPE and Choroid, ACCTA. This 
library was created for the program. Gene Discovery in the 
Visual System, supported by National Eye Institute (NEI) . 
TAG_TISSUE=human lens 
TAG_LIB=UI-E-EJO 
TAG_SEQ=CGATTAGCGA" 

ORIGIN 



Query Match 16.1%; Score 96.4; DB 3; Length 531; 

Best Local Similarity 99.0%; Pred. No. 1.3e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIMI 

Db 30 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 89 



Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

Illlllllllllllll lllllllllllllllllllll 
Db 90 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 127 



RESULT 8 

BM727000/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



BM727000 532 bp mRNA linear EST Ol-MAR-2002 

UI-E-EJO-air-d-04-O-UI.rl UI-E-EJO Homo sapiens cDNA clone 
UI-E-EJO-air-d-04-O-UI 5», mRNA sequence. 
BM727000 

BM727000.1 01:19048333 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 
1 (bases 1 to 532) 

Bonaldo, M. F. , Lennon, G. and Soares, M.B. 



TITLE Normalization and subtraction: two approaches to facilitate gene 
discovery 

JOURNAL Genome Res. 6 (9), 791-806 (1996) 
PUBMED 8889548 
COMMENT Contact: Soares, MB 

Coordinated Laboratory for Computational Genomics 
University of Iowa 

375 Newton Road , 4156 MEBRF, Iowa City, lA 52242, USA 
Tel: 319 335 8250 
Fax: 319 335 9565 
Email : bento-soares®uiowa . edu 
Tissue Procurement: Dr. Gregg Hageman 
cDNA Library preparation: Dr. M. Bento Soares, Univeristy of Iowa 
cDNA Library Arrayed by: Dr. M. Bento Soares, Univeristy of Iowa 
DNA Secjuencing by: Dr. M. Bento Soares, Univeristy of Iowa 
Clone Distribution: Researchers may obtain clones from Research 
Genetics (www.resgen.com). 
Seq primer: M13 Reverse. 
FEATURES Location/Qualifiers 
source 1. .532 

/organism="Homo sapiens" 
/mol_type= "mRNA" 
/db_xref="taxon:9606" 
/clone="UI-E-EJ0-air-d-04-0-UI " 

/tissue_type=" fetal eyes, lens, eye anterior segment, 
optic nerve, retina. Retina Foveal and Macular, RPE and 
Choroid" 

/dev_stage=" fetal and adult" 

/lab_host="DH10B (Life Technologies) (Tl phage resistant)" 
/clone_lib= "UI -E-EJO " 

/note= "Organ: eye; Vector: pT7T3-Pac (Pharmacia) with a 
modified polylinker; Site_l: EcoR I; Site_2 : Not I; 
UI -E-EJO is a subtracted cDNA library constructed 
according to Bonaldo, Lennon and Soares, Genome Research, 
6:791-806, 1996. First strand cDNA synthesis was primed 
with an oligo-dT primer containing a Not I site. Double 
stranded cDNA was ligated to an EcoR I adaptor, digested 
with Not I, and cloned directionally into pT7T3-Pac 
vector. The oligonucleotide used to prime the synthesis of 
first-strand cDNA contains a library tag sequence that is 
located between the Not I site and the {dT)18 tail. The 
sequence tags for this library are: fetal eyes, 
AGAATCAAGA; lens, CGATTAGCGA; eye anterior segment, 
AATGCCGCAT; optic nerve, CCATTAAGTG; retina, CCGCG; Retina 
Foveal and Macular, GTCC; RPE and Choroid, ACCTA. This 
library was created for the program. Gene Discovery in the 
Visual System, supported by National Eye Institute (NEI) . " 

ORIGIN 

Query Match 16.1%; Score 96.4; DB 3; Length 532; 

Best Local Similarity 99.0%; Pred. No. 1.3e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIMIIMIIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIill 

Db 4 93 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 434 



Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

llllllllllllllll llllllllillllllllllll 
Db 433 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 396 



RESULT 9 

DA315109/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



DA315109 578 bp mRNA linear EST 30-OCT-2005 

DA315109 BRHIP3 Homo sapiens cDNA clone BRHIP3005378 5', mRNA 
sequence. 
DA315109 

DA315109. 1 GI:78306086 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 578) 

Kimura,K., Wakamatsu, A. , Suzuki, Y., Ota,T., Nishikawa, T. , 
Yamashita,R. , Yamamoto,J., Sekine,M., Tsuritani , K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N,, 
Yoneyama , T . , Ot suka , R . , Kanda , K . , Yokoi , T . , Kondo , H . , Wagatsuma , M . , 
Murakawa,K. , Ishida,S. , Ishibashi,T. , Takahashi-Fuj ii , A. , 
Tanase,T., Nagai,K., Kikuchi,H., Nakai,K., Isogai,T. andSugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : f 1 j -cdna®nif ty . com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5 '-end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .578 

/organism="Homo sapiens" 

/ mo l_t ype = " mRNA " 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone="BRHIP3005378" 

/ 1 i s sue_type = " hippocampus " 

/ c lone_lib= " BRHIP3 " 

/note= "Vector: pME18SFL3" 



ORIGIN 



Query Match 16.1%; Score 96.4; DB 9; Length 578; 

Best Local Similarity 99.0%; Pred. No. 1.3e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 



0; 



Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
Db 264 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 205 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

iiiiiiiiiiiiiiii lllllllllllllllllilll 

Db 204 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 167 



RESULT 10 

DA262508/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 



JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



DA262508 580 bp mRNA linear EST 03-NOV-2005 

DA262508 BRCAN2 Homo sapiens cDNA clone BRCAN2015005 5', mRNA 
sequence . 
DA262508 

DA262508 .1 GI : 79168489 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 580) 

Kimura,K., Wakamatsu, A. , Suzuki, Y,, Ota,T., Nishikawa, T. , 
Yamashita/R. , Yamamoto,J., Sekine,M., Tsuritani, K. , Wakaguri,H., 
Ishii,S., Sugiyama,T., Saito,K., Isono,Y., Irie,R., Kushida,N. , 
Yoneyama,T., Otsuka,R., Kanda,K., Yokoi,T., Kondo,H., Wagatsuma, M. , 
Murakawa,K. , Ishida,S. , Ishibashi,T. , Takahashi-Fujii, A. , 
Tanase^T., Nagai^K., Kikuchi,H., Nakai,K., Isogai,T. and Sugano,S. 
Diversification of Transcriptional Modulation: Large-scale 
Identification and Characterization of Putative Alternative 
Promoters of Human Genes 
Genome Res. 16 (1), 55-65 (2006) 
16344560 

Contact: Takao Isogai 
FLJ Project (HRI Team) 
Helix Research Institute 

2-6-7 Kazusa-Kamatari, Kisarazu, Chiba, 292-0818, Japan 
Tel: 81-438-52-3975 
Fax: 81-438-52-3986 
Email : f Ij -cdna®nif ty . com 

NEDO human cDNA project (New Energy and Industrial Technology 
Developmental Organization, Japan) ; cDNA library construction: 
Helix Research Institute (HRI); 5 '-end one pass sequencing: HRI, 
Research Association for Biotechnology (RAB) and Biotechnology 
Center, National Institute of Technology and Evaluation; 3 ' -end one 
pass sequencing: RAB. 

Location/Qualifiers 

1. .580 

/organism="Homo sapiens" 

/mol_type= "mRNA" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone="BRCAN2015005" 

/tissue_type=" caudate nucleus" 

/clone_lib="BRCAN2 " 

/note= "Vector: pME18SFL3" 



ORIGIN 



Query Match 16.1%; Score 96.4; DB 9; Length 580; 

Best Local Similarity 99.0%; Pred. No. 1.3e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

lllllllllllllllilllllMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

Db 218 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 159 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

llllllllllllllll IIIIIIIIIIIIIIMIIIIi 
Db 158 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 121 



RESULT 11 

BF677732/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BF677732 842 bp mRNA linear EST 21-DEC-2000 

602085430F1 NIH_MGC_83 Homo sapiens cDNA clone IMAGE:4249854 5', 
mRNA sequence. 
BF677732 

BF677732 .1 GI : 11951627 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 

Hominidae; Homo. 

1 (bases 1 to 842) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rSmail .nih.gov 

Tissue Procurement: CLONETECH Laboratories, Inc. 
cDNA Library Preparation: CLONETECH Laboratories, Inc. 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
ht tp : / / image . 1 Inl . gov 
Plate: LLCM1070 row: k column: 07 
High quality sequence stop: 614. 
Location/Qualifiers 
1. .842 

/organism="Homo sapiens" 

/mol_type= "mRNA" 

/ db_xr e f = " t axon : 9 6 0 6 " 

/clone= " IMAGE : 4249854 " 

/lab_host="DHlOB (Tl phage - res i s tant ) " 

/ c 1 one_l ib = " N IH_MGC_8 3 " 

/note="Organ: prostate; Vector: pDNR-LIB (Clontech) ; 
Site_l: Sfil (ggccgcctcggcc) ; Site_2 : Sfil 
(ggccattatggcc) ; 5' and 3' adaptors were used in cloning 
as follows : 5 ' adaptor secjuence : 5 • -CACGGCCATTATGGCC-3 ' 
and 3 * adaptor sequence : 

5'-ATTCTAGAGGCCGAGGCGGCCGACATG-dT(30)BN-3« (where B = A, 



C, or G and N = A, C, G, or T) . Average insert size 1.4 
kb (range 0.5-4.0 kb) . 14/15 colonies contained inserts 
by PGR. This library was enriched for full-length clones 
and was constructed by Clontech Laboratories (Palo Alto, 
CA) . " 



ORIGIN 



Query Match 16.1%; 
Best Local Similarity 99.0%; 
Matches 97; Conservative 



Score 96.4; DB 7; 
Pred. No. 1.2e-06; 
0; Mismatches 1; 



Length 842; 



Indels 



0 ; Gaps 



0; 



Qy 

Db 



1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

llllllillllillllllllllllllllllllllllllllllllllllllllllllllll 

303 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 244 



Qy 

Db 



61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

Illlllllllllllll lllllllllllllllllllll 
243 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 206 



RESULT 12 

BG164149/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



FEATURES 

source 



BG164149 860 bp mRNA linear EST 06-FEB-2001 

602343020F1 NIH_MGC_89 Homo sapiens cDNA clone IMAGE : 4453276 5', 
mRNA sequence . 
BG164149 

BG164149 .1 GI : 12670852 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 

Hominidae ; Homo . 

1 (bases 1 to 860) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: ATCC 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 

DNA Sequencing by: Incyte Genomics, Inc. 

Clone distribution: MGC clone distribution information can be 
found through the I.M.A.G.E. Consortium/ LLNL at: 
http : //image . llnl . gov 
Plate: LLAM10243 row: g column: 05 
High quality secjuence stop: 687. 
Location/Qualifiers 
1. .860 

/organisms "Homo sapiens" 
/mol_type="mRNA" 
/db_xref = " taxon : 9606 " 
/clone= "IMAGE : 4453276" 

/ 1 i s sue_type = " hype r nephroma , cell 1 ine " 
/lab_host="DH10B (phage-resistant) " 
/clone lib="NIH MGC 89" 



/note= "Organ: kidney; Vector: pCMV-SP0RT6; Site_l : NotI; 
Site_2 : Sail; Cloned unidirectionally; oligo-dT primed. 
Average insert size 1.3 kb. Library enriched for 
full-length clones and constructed by Life Technologies. 
Note: this is a NIH_MGC Library." 

ORIGIN 



Query Match 16.1%; Score 96.4; DB 2; Length 860; 

Best Local Similarity 99.0%; Pred. No. 1.2e-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

IIIIMIIIMIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIII 

Db 232 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 173 



Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

llllllllllllllll lllllllllllllllllllll 
Db 172 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 135 



RESULT 13 

AY420526/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 

PUBMED 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 

COMMENT 

FEATURES 

source 



AY420526 2675 bp DNA linear GSS 17-DEC-2003 

Pan troglodytes DLGl gene, VIRTUAL TRANSCRIPT, partial sequence, 
genomic survey sequence. 
AY420526 

AY420526 . 1 GI : 39776483 
GSS. 

Pan troglodytes (chimpanzee) 
Pan troglodytes 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Pan . 

1 (bases 1 to 2675) 

Clark, A. G., Glanowski , S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum,D.M. , Civello, D.R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J. J. , 
Adams, M.D. and Cargill,M. 

Inferring nonneutral evolution from human-chimp-mouse orthologous 
gene trios 

Science 302 (5652), 1960-1963 (2003) 
14671302 

2 (bases 1 to 2675) 

Clark, A. G., Glanowski , S . , Nielson,R., Thomas, P., Kejariwal, A. , 
Todd, M. A., Tanenbaum,D.M. , Civello, D. R. , Lu,F., Murphy, B., 
Ferriera,S., Wang,G., Zheng, X.H., White, T. J., Sninsky, J.J. , 
Adams, M.D. and Cargill,M. 
Direct Submission 

Submitted (16-NOV-2003) Celera Genomics, 45 West Gude Drive, 
Rockville, MD 20850, USA 

This sequence as made by secjuencing genomic exons and ordering them 
based on alignment. 

Location/Qualifiers 

1. .2675 

/organism="Pan troglodytes" 
/mol_type= "genomic DNA" 
/db_xre f = " t axon : 9 5 9 8 " 



gene <1. .>2675 

/gene="DLGl" 

/ locus_t ag= " HCM7 253" 

ORIGIN 

Query Match 16.1%; Score 96.4; DB 14; Length 2675; 

Best Local Similarity 99.0%; Pred. No. l.le-06; 

Matches 97; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

llllllllllilllllllllllllllllllllllllllllllilllllllllliiiiiii 

Db 890 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 831 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

Illlllllllllllll lllllllllllllllllllli 
Db 830 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 793 



RESULT 14 

AA480937 

LOCUS 

DEFINITION 



ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



AA480937 447 bp mRNA linear EST 14-AUG-1997 

aa28c06.rl NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:814570 5' 
similar to TR:G558438 G558438 HOMOLOG OF DROSOPHILA DISCS LARGE 
PROTEIN, ISOFORM 1. ;, mRNA sequence. 
AA480937 

AA480937.1 GI: 2210489 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 447) 

NCI -CGAP http : //www . ncbi . nlm . nih . gov/ncicgap . 

National Cancer Institute, Cancer Genome Anatomy Project (CGAP) , 

Tumor Gene Index 

Unpublished (1997) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-rOmail .nih.gov 

Tissue Procurement: Louis M. Staudt, M.D., Ph.D., David Allman, 
Ph.D., Gerald Marti, M.D. 

cDNA Library Preparation: M. Bento Scares, Ph.D., M. Fatima 
Bonaldo, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Secjuencing Center 
Clone distribution: NCI -CGAP clone distribution information can be 

found through the I.M.A.G.E. Consortium/LLNL at: 

www-bio. llnl .gov/bbrp/ image/ image .html 



Possible reversed clone: similarity on wrong strand 
Seq primer: -28ml3 revl ET from Amersham 
High quality sequence stop: 385, 
FEATURES Location/Qualifiers 
source 1. .447 

/organism="Homo sapiens" 

/mol_type= "mRNA" 

/db xref="GDB: 6032940" 



ORIGIN 



/ db_xr e f = " t axon : 9 6 0 6 " 

/clone= " IMAGE ; 814570 " 

/tissue_type= "germinal center B cell" 

/ lab_hos t = " DHl OB " 

/clone_l ib= "NCI_CGAP_GCB1 " 

/note= "Vector : pT7T3D-PacI; Site_l: Not I; Site_2 : Eco RI; 
1st strand cDNA was prepared from human tonsillar cells 
enriched for germinal center B cells by flow sorting 
(CD20+, IgD-), provided by Dr. Louis M. Staudt (NCI), Dr. 
David Allman (NCI) and Dr. Gerald Marti (CBER) . cDNA 
synthesis was primed with a Not I - oligo{dT) primer 
[5 » -TGTTACCAATCTGAAGTGGGAGCGGCCGCCTCATTTTTTTTTTTTTTTTTT-3 ' 
] . Double- Stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library 
went through one round of normalization, and was 
constructed by Bento Scares and M. Fatima Bonaldo." 



Query Match 15-8%; Score 94.8; DB 1; Length 447; 

Best Local Similarity 98.0%; Pred. No. 2.5e-06; 

Matches 96; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 
Qy 1 TACAATAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 60 

MM MMMMMMMMMMMMMMMMMMMMMMMMMMMI 

Db 41 TACAGTAGACCCTGCTTCTTTCAACGCTTCAACTGCTTTGCTATGTGTTACATCACGAAC 100 

Qy 61 ATCTACTTCATTTACTTGTAATATACAGTCATTGACCC 98 

MMMMMIMMI MMMMMMMMMMI 

Db 101 ATCTACTTCATTTACTCGTAATATACAGTCATTGACCC 138 



RESULT 15 

BP229026/C 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 



BP229026 582 bp mRNA linear EST 15-SEP-2004 

BP229026 Sugano cDNA library, fetal brain Homo sapiens cDNA clone 
FBR01745, mRNA sequence. 
BP229026 

BP22 9026 . 1 GI : 52101936 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Catarrhini; 
Hominidae ; Homo . 
1 (bases 1 to 582) 

Suzuki, Y., Yamashita,R. , Shirota,M., Sakakibara, Y. , Chiba,J., 
Mizushima-Sugano, J. , Nakai,K. and Sugano, S. 

Sequence comparison of human and mouse genes reveals a homologous 
block structure in the promoter regions 
Genome Res. 14 (9), 1711-1718 (2004) 
15342556 

Contact: Yutaka Suzuki 
Department of Virology 

Institute of Medical Science, University of Tokyo 
4-6-1, Shirokanedai, Minatoku, Tokyo 108-8639, Japan 
Tel: 81-3-5449-5343 



